• “创新从来都是九死一生”(人民论坛) 2019-02-14
  • 端午假期广州铁路运客640.5万人次 创历史新高 2019-02-14
  • 19次生态输水让塔河下游生机勃勃 2018-11-22
  • 男篮再胜伊朗迎热身赛两连胜 任骏飞19+11陶汉林18分 2018-11-22
  • 小卒子,你南街村的代言人啊?扮豬不咋像呢!你滴,大大滴,明白? 2018-11-22
  • 女性之声——全国妇联 2018-11-21
  • 新华网评:凝聚打赢脱贫攻坚战的强大合力 2018-11-21
  • 栗战书:执法检查要直面问题不搞评功摆好 让法律制度成为不可触碰的高压线 2018-11-21
  • 这些水果越新鲜越不能吃 放一放更好吃 2018-11-21
  • 生产资料公有制不会也不可能涉及生产资料的分配,这完全是你杜撰的,是强词夺理的。从这点看,你的所谓逻辑是幼稚可笑的。哈哈哈哈! 2018-11-20
  • 践行“两山论”是一场发展的革命 2018-11-20
  • 女教师舍身保护学生被撞身亡感动各界 2018-11-20
  • Welcome to

    Before you start posting please read the forum rules. By posting to this forum you agree to abide by the rules.

     

    Go Back   Doom9's Forum > Announcements and Chat > General Discussion

    Reply
     
    Thread Tools Search this Thread Display Modes
    Old 30th January 2019, 16:55   #41  |  Link
     
    Groucho2004's Avatar
     
    Join Date: Mar 2006
    Posts: 4,040
    Quote:
    Originally Posted by almosely View Post
    Tested QuickSync + FFT3dFilter 2.5 (sigma 1.0, sharpen 0.16) in MT-mode: Every trial was more slowly (2,4,6 threads, with and without requestlinear;19-21 fps)
    Multi-threading should speed this up. Post your complete script.
    Groucho2004 is offline   Reply With Quote
    Old 30th January 2019, 20:57   #42  |  Link
    Registered User
     
    Join Date: Dec 2006
    Location: Germany
    Posts: 42
    Single-Threaded-Script:
    -----------------------

    DGSourceIM("clip.dgi", engine=1)

    Trim(515, 4291)

    Crop(0, 0, -0, -8)

    FFT3DFilter(sigma=1.0, beta=1.0, bw=32, bh=32, sharpen=0.16, scutoff=0.27, plane=0, bt=3)

    return last


    Multi-Threaded-Script:
    ----------------------

    DGSourceIM("clip.dgi", engine=1)

    RequestLinear(rlim=50, clim=50) # tested with and without; with has been faster

    Trim(515, 4291)

    Crop(0, 0, -0, -8)

    FFT3DFilter(sigma=1.0, beta=1.0, bw=32, bh=32, sharpen=0.16, scutoff=0.27, plane=0, bt=3)

    Prefetch(2) # tested with 1,2,3,4,6

    return last


    And I am using the "mtmodes.avsi" from here: //publishwith.me/ep/pad/view/ro.rDkwcdWn4k9/latest


    Results: AVSMeter 2.8.9 (x64):
    ------------------------------

    1) ST: 44.18 fps (CPU usage: 25%)

    2) MT(2): 58.98 fps (CPU usage: 50%) # with RequestLinear(50,50)

    3) MT(2): 57.73 fps (CPU usage: 50%) # with RequestLinear(100,100)

    4) MT(3): 55.90 fps (CPU usage: 74%) # with RequestLinear(50,50)

    5) MT(3): 60.83 fps (CPU usage: 74%) # with RequestLinear(100,100)

    6) MT(4): 57.42 fps (CPU usage: 95%) # with RequestLinear(100,100)


    Results: Simple x264/x265 Launcher (64-Bit) 2.89.1138:
    ------------------------------------------------------

    1) 22.49 fps

    2) 21.51 fps

    5) 20.53 fps


    The corresponding x264.exe line:
    --------------------------------
    --output-depth 8 --crf 18.0 --preset medium --tune film --trellis 2 --direct auto --me umh --partitions all --vbv-maxrate 24000 --vbv-bufsize 30000 --b-adapt 2 --bframes 3 --merange 16 --ref 3 --keyint 240 --subme 10 --aq-mode 1 --sar 1:1 --rc-lookahead 40 --output "clip.mkv" --frames 3777 --demuxer y4m --stdin y4m -


    And sometimes, the MT-Job crashes within Simple Launcher. To be up-to-date, I updated the Simple Launcher a few hours ago, and with this new version, the fps got less too (from 22.67 fps to 22.49 fps).

    I let AVS+ autoload all plugins and sripts; I put everything I need into the corresponding "plugins64"-folder:

    addgrain.avs
    AddGrainC.dll
    avstp.dll
    CheckTopFirst.avsi
    colormatrix.dll
    CompTest.avsi
    DGDecodeIM.dll
    DGDecodeNV.dll
    dither.avsi
    dither.dll
    fft3dfilter.dll
    FFT3dGPU.dll
    fft3dgpu.hlsl
    libmfxsw64.dll
    masktools2.dll
    mt_xxpand_multi.avsi
    mtmodes-rev.850.avsi
    RgTools.dll
    TIVTC.dll

    These are the installed filter and script-versions:

    AddGrainC 1.7.1 (25-11-2013)
    ColorMatrix 2.5 (20-03-2010)
    DGDecNV 2052 (30-07-2016)
    DGDecodeIM beta50 (10-10-2015)
    Dither tools 1.27.2 (30-12-2015)
    FFT3DFilter 2.5 (02-07-2018)
    FFT3dGPU 0.8.4 (21-11-2018)
    FFTW 3.3.8 (28-05-2018)
    MaskTools2 2.2.18 (05-09-2018)
    RgTools 0.97 (02-07-2018)
    TIVTC 1.0.11 (23-03-2018)

    Last edited by almosely; 30th January 2019 at 21:07.
    almosely is offline   Reply With Quote
    Old 30th January 2019, 21:22   #43  |  Link
     
    Groucho2004's Avatar
     
    Join Date: Mar 2006
    Posts: 4,040
    I ran a few tests with your script, it seems that fft3dfilter doesn't scale well in mt setups. I vaguely remember that it has adaptive mt mode registration depending on parameter values. The bottleneck could also be the FFTW library.
    Groucho2004 is offline   Reply With Quote
    Old 30th January 2019, 22:27   #44  |  Link
    Registered User
     
    Join Date: Dec 2006
    Location: Germany
    Posts: 42
    Quote:
    Originally Posted by FFT3DFilter 2.5 (02-07-2018) (x64) Documentation
    Version 2.3 - February 21, 2017
    - apply current avs+ headers
    - 10-16 bits and 32 bit float colorspace support in AVS+
    - Planar RGB support
    - look for libfftw3f-3.dll first, then fftw3.dll
    - inline asm ignored on x64 builds
    - pre-check: if plane to process for greyscale is U and/or V then returns original clip
    - auto register MT mode for avs+: MT_SERIALIZED
    - autoscale sigma and smin/smax parameter from 8 bit scale if colorspace is different
    Version 2.4 - June 08, 2017
    - some inline asm (not all) ported to simd intrisics, helps speedup x64 mode, but some of them faster also on x86.
    - intrinsics bt=0
    - intrinsics bt=2, degrid=0, pfactor=0
    - intrinsics bt=3 sharpen=0/1 dehalo=0/1
    - intrinsics bt=3
    - Adaptive MT settings for Avisynth+: MT_SERIALIZED for bt==0 (temporal), MT_MULTI_INSTANCE for others
    - Copy Alpha plane if exists
    - reentrancy checks against bad multithreading usage
    Note: for properly operating in MT_SERIALIZED mode in Avisynth MT, please use Avs+ r2504 or better.
    Version 2.5 - July 02, 2018
    - Change 32 bit float formar: U/V chroma center to zero instead of 0.5 to match Avisynth+ r2728
    Yes, it has. I use bt=3, so mt-mode 2 will be used.

    But I think I have to use AVS+ in 32-bit-mode or migrate back to AVS 2.6.0 MT (SEt), because the FFT3dFilter 2.5 (and 2.4 and 2.3) is messing with the luma, even when the filter is only in the filter-chain without adjustments. It looks like FFT3dFilter dithers and brightens and darkens the image, just when in the filter-chain included - it seems to be an issue with colorspace- or bit-depth-conversion to me. Maybe the old 2.1.1 version (2007) from Fizick is working right and I can use that one (but I did not find any 64-bit version of it and don't know if it's working with AVS+). FFT3DGPU does work fine, regarding that. And AVS 2.6.0 did not crash with this script, wether in ST- nor in MT-Mode.

    Last edited by almosely; 30th January 2019 at 22:33.
    almosely is offline   Reply With Quote
    Old 30th January 2019, 23:56   #45  |  Link
     
    Groucho2004's Avatar
     
    Join Date: Mar 2006
    Posts: 4,040
    Quote:
    Originally Posted by almosely View Post
    Maybe the old 2.1.1 version (2007) from Fizick is working right and I can use that one (but I did not find any 64-bit version of it and don't know if it's working with AVS+).
    I made a 64 bit build of 2.1.1 some time ago. You're welcome to try it. It does work just fine with AVS+.
    Groucho2004 is offline   Reply With Quote
    Old 31st January 2019, 00:19   #46  |  Link
    Registered User
     
    Join Date: Dec 2006
    Location: Germany
    Posts: 42
    Cool, thank you! :-) ... but, I just tried: Still not working right. Must be a problem of AVS+ (x64) :-(

    -edit-

    Quote:
    Originally Posted by almosely View Post
    I observed a big difference when comparing fft3dfilter against fft3dgpu within AvsPmod at first sight (histogram "luma" activated).
    Grml ... Obviously it's a difference in general between FFT3dFilter and FFT3DGPU :-( At that point I had'nt installed AVS+ and still tested with AVS 2.6.0. But how could it be, that nobody realised that problem before? Perhaps I should check every parameter - maybe a default-value is set wrong?

    Last edited by almosely; 31st January 2019 at 00:47.
    almosely is offline   Reply With Quote
    Old 10th February 2019, 20:28   #47  |  Link
    Registered User
     
    Join Date: Dec 2006
    Location: Germany
    Posts: 42
    So, after a long term of testing AVS 2.6.0 MT (SEt) (x86) vs. AviSynth+ 0.1.0 r2772 MT (x64) I came to the conclusion, that AVS+ is faster in general - and, at least with my filter-collection - same/more stable.

    With the newest available versions of my filters, VC Redist 2017 and AVS+, the encoding frame-rate went up from 16.76 fps to 16.99 fps and the one from AVSMeter 2.8.9 from 40.37 to 43.49 fps (with one common test-clip).

    But I discovered something more and that I will post within the corresponding AVS+ thread in a few minutes:

    //www.zs-x.com/showthread.p...68856&page=225
    almosely is offline   Reply With Quote
    Reply

    Tags
    fftw, fftw3.dll


    Posting Rules
    You may not post new threads
    You may not post replies
    You may not post attachments
    You may not edit your posts

    BB code is On
    Smilies are On
    [IMG] code is On
    HTML code is Off

    Forum Jump


    All times are GMT +1. The time now is 06:24.


    Powered by vBulletin® Version 3.8.11
    Copyright ©2000 - 2019, vBulletin Solutions Inc.
  • “创新从来都是九死一生”(人民论坛) 2019-02-14
  • 端午假期广州铁路运客640.5万人次 创历史新高 2019-02-14
  • 19次生态输水让塔河下游生机勃勃 2018-11-22
  • 男篮再胜伊朗迎热身赛两连胜 任骏飞19+11陶汉林18分 2018-11-22
  • 小卒子,你南街村的代言人啊?扮豬不咋像呢!你滴,大大滴,明白? 2018-11-22
  • 女性之声——全国妇联 2018-11-21
  • 新华网评:凝聚打赢脱贫攻坚战的强大合力 2018-11-21
  • 栗战书:执法检查要直面问题不搞评功摆好 让法律制度成为不可触碰的高压线 2018-11-21
  • 这些水果越新鲜越不能吃 放一放更好吃 2018-11-21
  • 生产资料公有制不会也不可能涉及生产资料的分配,这完全是你杜撰的,是强词夺理的。从这点看,你的所谓逻辑是幼稚可笑的。哈哈哈哈! 2018-11-20
  • 践行“两山论”是一场发展的革命 2018-11-20
  • 女教师舍身保护学生被撞身亡感动各界 2018-11-20