FFT3DGPU

From Avisynth wiki
(Difference between revisions)
Jump to: navigation, search
m (link fix)
(add precision note and fix a typo)
 
(9 intermediate revisions by 2 users not shown)
Line 1: Line 1:
#REDIRECT [[External_filters#Spatio-Temporal_Denoisers]]
+
{{FilterCat6|External_filters|Plugins|Plugins_x64|Denoisers|Spatial-Temporal_Denoisers|Deep_color_tools}}
{{FilterCat|External_filters|Denoisers|Spatial-Temporal_Denoisers}}
+
{{Filter3
 +
| {{Author/tsp}}, {{Author/pinterf}}
 +
| v0.8.6
 +
|[https://github.com/pinterf/FFT3dGPU/releases FFT3dGPU-v0.8.6-pfmod.7z]
 +
| Denoisers
 +
| [https://www.gnu.org/licenses/gpl-2.0.txt GPLv2]
 +
|6=[https://forum.doom9.org/showthread.php?t=89941 Doom9 Forum (original)]
 +
}}
 +
 
 +
== Description ==
 +
FFT3dGPU is a GPU version of Fizick's [[FFT3DFilter]]. The algorithm (Fast Fourier Transform, denoising) is the same for the most part. Currently the following is not implemented: support for noise pattern.
 +
 
 +
In this version the next frame is processed while waiting for the GPU to end it's work. Meaning the filters before fft3dGPU are working concurrently with it. From v0.8.4, the filter automatically registers itself as [[MT_SERIALIZED]] for AviSynth+.
 +
<br>
 +
<br>
 +
 
 +
== Requirements ==
 +
* [x86]: [[AviSynth+]] or [https://sourceforge.net/projects/avisynth2/ AviSynth 2.6.0]
 +
* [x64]: [[AviSynth+]]
 +
* Supported color formats: [[Y8]], [[YV12]], [[YV16]], [[YV24]], [[YV411]]
 +
** AviSynth+: All [[planar]] formats (8/10/12/14/16/32-bit, YUV/RGB with or without alpha) are supported.
 +
<br>
 +
 
 +
== [[Script variables|Syntax and Parameters]] ==
 +
{{Template:FuncDef|FFT3DGPU (clip, float "sigma", float "beta", int "bw", int "bh", int "bt", float "sharpen", int "plane", int "mode", int "bordersize", int "precision", bool "NVPerf", float "degrid", float "scutoff", float "svr", float "smin", float "smax", float "kratio", int "ow", int "oh", int "wintype" , int "interlaced", float "sigma2", float "sigma3", float "sigma4", bool "oldfft") }}
 +
 
 +
<br>
 +
::{{Par2| |clip| }}
 +
:::Input clip
 +
<br>
 +
::{{Par2|sigma|float|2.0}}
 +
:::Given noise value for all (or highest) frequencies. Only values greater that 0.0
 +
<br>
 +
::{{Par2|beta|float|1.0}}
 +
:::Noise margin (float>=1.0, default=1.0 for no noise left): control noise amount left in the restoration for Wiener filter, so max filter noise attenuation = <code>(beta-1)/beta</code>.
 +
<br>
 +
::{{Par2|bw|int|32}}
 +
::{{Par2|bh|int|32}}
 +
:::Block width and height. It should be a power of 2 ie valid values is 4,8,16,32,64,128,256,512 (note that <code>bw</code> should be greater than 4 for best result).
 +
<br>
 +
::{{Par2|bt|int|3}}
 +
:::Block temporal size, number of frames.
 +
:::*-1 : sharpen only (2D)
 +
:::*0 : all previous frames (switch Kalman filter mode)
 +
:::*1 : only current frame (spatial 2D Wiener filter)
 +
:::*2 : previous and current frame (3D Wiener filter)
 +
:::*3 : previous, current and next frame (3D Wiener filter)
 +
:::*4 : two previous, current and next frame (3D Wiener filter)
 +
<br>
 +
::{{Par2|sharpen|float|0.0}}
 +
:::Sharpening strength, positive values sharpens the image, negative values blurs the image. 0 disables sharpening.
 +
<br>
 +
::{{Par2|plane|int|0}}
 +
:::Processed color plane:
 +
:::* 0 : luma(Y)
 +
:::* 1 : chroma U
 +
:::* 2 : chroma V
 +
:::* 3 : chroma planes U and V
 +
:::* 4 : both luma and chroma
 +
:::For a greyscale clip. 4 means luma (the only plane), 1,2 and 3 gives error message.
 +
:::For RGB clips (planar RGB only!) operation is always 4=all planes, regardless of this parameter.
 +
<br>
 +
::{{Par2|mode|int|1}}
 +
:::Processing mode:
 +
:::* 0 : only overlaps 1:1. This is faster but produces artifacts with high sigma values.
 +
:::* 1 : block overlaps 2:1. This is slower but produces fewer artifacts.
 +
:::* 2 : again 1:1 overlap but with a additional border. This reduces border artifacts seen with mode=0.
 +
:::The speed is between mode 0 and 1. <code>Kalman(bt=0)</code> works well with <code>mode=0</code>.
 +
<br>
 +
::{{Par2|bordersize|int|1}}
 +
:::Only used with <code>mode</code> 2. Defines the size of the border.
 +
<br>
 +
::{{Par2|precision|int|0}}
 +
:::*0 : to use 16 bit floats(half precision).
 +
:::*1 : to use 32 bit float(single precision) for the fft and 16 bit float for the wienner/kalman and sharpening.
 +
:::*2 : always use 32 bit floats.
 +
:::Using 16 bit float increases the performance but reduces precision. With a Geforce 7800GT precision=0 is ~1.5 times faster than than mode 2.
 +
:::Note: in FFT3dGPU v0.8.6, for 10-32 bit clips, the parameter "precision" is ignored and is set to 2: always use 32 bit float internally.
 +
<br>
 +
::{{Par2|NVPerf|bool|false}}
 +
:::Enables support for NVPerfHUD (http://developer.nvidia.com/object/nvperfhud_home.html).
 +
<br>
 +
::{{Par2|degrid|float|1.0}}
 +
:::Enables degriding. Only works well with <code>mode=1</code>. Doesn't degrid the Kalman filter (but it does degrid the sharpening (if enabled) after kalman filter). default 1.0 for <code>mode=1</code>, 0.0 for <code>mode=0</code> or 2.
 +
<br>
 +
::{{Par2|scutoff|float|0.3}}
 +
::: Sharpening cutoff frequency, relative to max.
 +
<br>
 +
::{{Par2|svr|float|1.0}}
 +
:::Sharpening (and dehalo) vertical ratio (to horizontal) (default=1.0 - same as horizontal, 0 - no vertical sharpening)
 +
<br>
 +
::{{Par2|smin|float|4.0}}
 +
::{{Par2|smin|float|20.0}}
 +
::: Minimum and maximum limit (approximate noise margin) for sharpening stage.
 +
<br>
 +
::{{Par2|kratio|float|2.0}}
 +
:::Ratio of threshold to sigma to reset Kalman filter. Variation threshold = <code>sigma*kratio</code>, good value is about from 1.5 to 3.0
 +
<br>
 +
::{{Par2|ow|int|bw/2}}
 +
::{{Par2|oh|int|bh/2}}
 +
:::Overlap width and height. This only works with <code>mode=1</code>. This specifies how big the overlap between the blocks are. Overlap size must be less than or equal to half the blocksize. <code>ow</code> must be even.
 +
<br>
 +
::{{Par2|wintype|int|0}}
 +
:::weighting windows type:
 +
:::*0 : same analysis and synthesis half-cosine window, used in all versions before 1.4.
 +
:::*1 : intermediate between 0 and 2
 +
:::*2 : flat analysis window, rised cosine (Hanning) synthesis window.
 +
<br>
 +
::{{Par2|interlaced|bool|false}}
 +
:::Set to true for separate filtering for each field.
 +
<br>
 +
::{{Par2|singma2|float|sigma}}
 +
::{{Par2|singma3|float|sigma}}
 +
::{{Par2|singma4|float|sigma}}
 +
:::Given noise value at second, third, and lowest scale level frequencies. Only values greater that 0.0
 +
<br>
 +
::{{Par2|oldfft|bool|false}}
 +
:::Set to true to use the old fftcode (used in version 0.6.2 and lower) false to use new fft code. If not defined fft3dgpu will use the fastest code.
 +
<br>
 +
 
 +
== Changelog ==
 +
Version      Date            Changes<br>
 +
v0.8.6      2021/01/04      - Support 10-32 bit formats. Parameter ranges (smin, smax, sigmas) are the same for all bit depths.
 +
                              - For 10-32 bit clips parameter "precision" is ignored and is set to 2: always use 32 bit float internally
 +
                              - Note: for 8 bits "precision" default is 0: calculation is done in 16 bit floating point type (s10m5) internally<br>
 +
v0.8.5      2020/11/01      - Avisynth+ 3.6 additional support: preserve frame properties
 +
                              - Fix bt=0 Kalman mode warning in fft3dgpu.hlsl<br>
 +
v0.8.4      2018/11/21      - New color spaces besides YV12 and YUY2:
 +
                                Y8, YV16, YV411, YV24 (all 8 bits), 8 bit Planar RGB (AviSynth+)
 +
                              - FFT3dGPU filter registers itself as [[MT_SERIALIZED]] automatically
 +
                                for AviSynth+: no need for SetFilterMTMode.<br>
 +
v0.8.3      2018/11/18      - project hosted on https://github.com/pinterf/FFT3dGPU
 +
                              - source and project files updated to build under Visual Studio 2017
 +
                              - (15.9) using DX SDK June 2010
 +
                              - Moved to Avisynth Interface 6 (v2.6) - using headers from the AviSynth+ project
 +
                              - added x64 platform<br>
 +
v0.8.2a      2006/08/24      - Last release by tsp, see included docs for full changelog
 +
<br>
 +
== Archived Downloads ==
 +
{| class="wikitable" border="1"; width="600px"
 +
|-
 +
!!width="100px"| Version
 +
!!width="150px"| Download
 +
!!width="150px"| Mirror
 +
|-
 +
!v0.8.2a
 +
|[https://www.dropbox.com/s/b25nik2r4qkujf9/fft3dgpu0.8.2a.exe?dl=1 fft3dgpu0.8.2a.exe]
 +
|[https://web.archive.org/web/20200613030343if_/https://files.videohelp.com/u/223002/fft3dgpu0.8.2a.exe fft3dgpu0.8.2a.exe]
 +
|-
 +
!v0.8.2 (x64)
 +
|[https://web.archive.org/web/20200613010756if_/https://files.videohelp.com/u/223002/FFT3DGPU_3-15-2010.rar FFT3DGPU_3-15-2010.rar]
 +
|[http://www.mediafire.com/file/2chnt1jkwwm/FFT3DGPU_3-15-2010.rar/file FFT3DGPU_3-15-2010.rar]
 +
|-
 +
!v0.8.2
 +
|[http://www.avisynth.nl/users/tsp/fft3dgpu0.8.2.7z fft3dgpu0.8.2.7z] /// [http://www.avisynth.nl/users/tsp/fft3dgpu0.8.2.exe fft3dgpu0.8.2.exe]
 +
|[https://web.archive.org/web/20170616074504if_/http://avisynth.nl/users/tsp/fft3dgpu0.8.2.7z fft3dgpu0.8.2.7z] /// [https://web.archive.org/web/20170914090422if_/http://www.avisynth.nl/users/tsp/fft3dgpu0.8.2.exe fft3dgpu0.8.2.exe]
 +
|}
 +
*v0.8.2 x64 version compiled by Joshy D.
 +
<br>
 +
== External Links ==
 +
<br>
 +
<br>
 +
-----------------------------------------------
 +
'''Back to [[External_filters#Spatio-Temporal_Denoisers|External Filters]] &larr;'''

Latest revision as of 16:48, 18 September 2021

Abstract
Author tsp, pinterf
Version v0.8.6
Download FFT3dGPU-v0.8.6-pfmod.7z
Category Denoisers
License GPLv2
Discussion Doom9 Forum (original)

Contents

[edit] Description

FFT3dGPU is a GPU version of Fizick's FFT3DFilter. The algorithm (Fast Fourier Transform, denoising) is the same for the most part. Currently the following is not implemented: support for noise pattern.

In this version the next frame is processed while waiting for the GPU to end it's work. Meaning the filters before fft3dGPU are working concurrently with it. From v0.8.4, the filter automatically registers itself as MT_SERIALIZED for AviSynth+.

[edit] Requirements


[edit] Syntax and Parameters

FFT3DGPU (clip, float "sigma", float "beta", int "bw", int "bh", int "bt", float "sharpen", int "plane", int "mode", int "bordersize", int "precision", bool "NVPerf", float "degrid", float "scutoff", float "svr", float "smin", float "smax", float "kratio", int "ow", int "oh", int "wintype" , int "interlaced", float "sigma2", float "sigma3", float "sigma4", bool "oldfft")


clip   =
Input clip


float  sigma = 2.0
Given noise value for all (or highest) frequencies. Only values greater that 0.0


float  beta = 1.0
Noise margin (float>=1.0, default=1.0 for no noise left): control noise amount left in the restoration for Wiener filter, so max filter noise attenuation = (beta-1)/beta.


int  bw = 32
int  bh = 32
Block width and height. It should be a power of 2 ie valid values is 4,8,16,32,64,128,256,512 (note that bw should be greater than 4 for best result).


int  bt = 3
Block temporal size, number of frames.
  • -1 : sharpen only (2D)
  • 0 : all previous frames (switch Kalman filter mode)
  • 1 : only current frame (spatial 2D Wiener filter)
  • 2 : previous and current frame (3D Wiener filter)
  • 3 : previous, current and next frame (3D Wiener filter)
  • 4 : two previous, current and next frame (3D Wiener filter)


float  sharpen = 0.0
Sharpening strength, positive values sharpens the image, negative values blurs the image. 0 disables sharpening.


int  plane = 0
Processed color plane:
  • 0 : luma(Y)
  • 1 : chroma U
  • 2 : chroma V
  • 3 : chroma planes U and V
  • 4 : both luma and chroma
For a greyscale clip. 4 means luma (the only plane), 1,2 and 3 gives error message.
For RGB clips (planar RGB only!) operation is always 4=all planes, regardless of this parameter.


int  mode = 1
Processing mode:
  • 0 : only overlaps 1:1. This is faster but produces artifacts with high sigma values.
  • 1 : block overlaps 2:1. This is slower but produces fewer artifacts.
  • 2 : again 1:1 overlap but with a additional border. This reduces border artifacts seen with mode=0.
The speed is between mode 0 and 1. Kalman(bt=0) works well with mode=0.


int  bordersize = 1
Only used with mode 2. Defines the size of the border.


int  precision = 0
  • 0 : to use 16 bit floats(half precision).
  • 1 : to use 32 bit float(single precision) for the fft and 16 bit float for the wienner/kalman and sharpening.
  • 2 : always use 32 bit floats.
Using 16 bit float increases the performance but reduces precision. With a Geforce 7800GT precision=0 is ~1.5 times faster than than mode 2.
Note: in FFT3dGPU v0.8.6, for 10-32 bit clips, the parameter "precision" is ignored and is set to 2: always use 32 bit float internally.


bool  NVPerf = false
Enables support for NVPerfHUD (http://developer.nvidia.com/object/nvperfhud_home.html).


float  degrid = 1.0
Enables degriding. Only works well with mode=1. Doesn't degrid the Kalman filter (but it does degrid the sharpening (if enabled) after kalman filter). default 1.0 for mode=1, 0.0 for mode=0 or 2.


float  scutoff = 0.3
Sharpening cutoff frequency, relative to max.


float  svr = 1.0
Sharpening (and dehalo) vertical ratio (to horizontal) (default=1.0 - same as horizontal, 0 - no vertical sharpening)


float  smin = 4.0
float  smin = 20.0
Minimum and maximum limit (approximate noise margin) for sharpening stage.


float  kratio = 2.0
Ratio of threshold to sigma to reset Kalman filter. Variation threshold = sigma*kratio, good value is about from 1.5 to 3.0


int  ow = bw/2
int  oh = bh/2
Overlap width and height. This only works with mode=1. This specifies how big the overlap between the blocks are. Overlap size must be less than or equal to half the blocksize. ow must be even.


int  wintype = 0
weighting windows type:
  • 0 : same analysis and synthesis half-cosine window, used in all versions before 1.4.
  • 1 : intermediate between 0 and 2
  • 2 : flat analysis window, rised cosine (Hanning) synthesis window.


bool  interlaced = false
Set to true for separate filtering for each field.


float  singma2 = sigma
float  singma3 = sigma
float  singma4 = sigma
Given noise value at second, third, and lowest scale level frequencies. Only values greater that 0.0


bool  oldfft = false
Set to true to use the old fftcode (used in version 0.6.2 and lower) false to use new fft code. If not defined fft3dgpu will use the fastest code.


[edit] Changelog

Version      Date            Changes
v0.8.6 2021/01/04 - Support 10-32 bit formats. Parameter ranges (smin, smax, sigmas) are the same for all bit depths. - For 10-32 bit clips parameter "precision" is ignored and is set to 2: always use 32 bit float internally - Note: for 8 bits "precision" default is 0: calculation is done in 16 bit floating point type (s10m5) internally
v0.8.5 2020/11/01 - Avisynth+ 3.6 additional support: preserve frame properties - Fix bt=0 Kalman mode warning in fft3dgpu.hlsl
v0.8.4 2018/11/21 - New color spaces besides YV12 and YUY2: Y8, YV16, YV411, YV24 (all 8 bits), 8 bit Planar RGB (AviSynth+) - FFT3dGPU filter registers itself as MT_SERIALIZED automatically for AviSynth+: no need for SetFilterMTMode.
v0.8.3 2018/11/18 - project hosted on https://github.com/pinterf/FFT3dGPU - source and project files updated to build under Visual Studio 2017 - (15.9) using DX SDK June 2010 - Moved to Avisynth Interface 6 (v2.6) - using headers from the AviSynth+ project - added x64 platform
v0.8.2a 2006/08/24 - Last release by tsp, see included docs for full changelog


[edit] Archived Downloads

Version Download Mirror
v0.8.2a fft3dgpu0.8.2a.exe fft3dgpu0.8.2a.exe
v0.8.2 (x64) FFT3DGPU_3-15-2010.rar FFT3DGPU_3-15-2010.rar
v0.8.2 fft3dgpu0.8.2.7z /// fft3dgpu0.8.2.exe fft3dgpu0.8.2.7z /// fft3dgpu0.8.2.exe
  • v0.8.2 x64 version compiled by Joshy D.


[edit] External Links




Back to External Filters

Personal tools