Resize

From Avisynth wiki
(Difference between revisions)
Jump to: navigation, search
(formatting, links, phrasing - see User:Raffriff42/sandbox for history details)
m (Spline based resizers: - the more sample points used, the more accurate the resampling (?))
Line 192: Line 192:
 
'''Spline16Resize''', '''Spline36Resize''' and '''Spline64Resize''' are three [[Wikipedia:Spline_interpolation|Spline based]] resizers. They are the (cubic) spline based resizers from [http://sourceforge.net/projects/panotools/ Panorama tools] that fit a spline through the sample points and then derives the filter kernel from the resulting blending polynomials. See [http://forum.doom9.org/showthread.php?t=147117 this thread] for the technical details.
 
'''Spline16Resize''', '''Spline36Resize''' and '''Spline64Resize''' are three [[Wikipedia:Spline_interpolation|Spline based]] resizers. They are the (cubic) spline based resizers from [http://sourceforge.net/projects/panotools/ Panorama tools] that fit a spline through the sample points and then derives the filter kernel from the resulting blending polynomials. See [http://forum.doom9.org/showthread.php?t=147117 this thread] for the technical details.
  
The rationale for '''Spline''' is to be as sharp as possible with less ringing artifacts than [[#LanczosResize|LanczosResize]] produces. '''Spline16Resize''' uses <tt>sqrt(16)=4</tt> sample points, '''Spline36Resize''' uses 6 sample points, etc ... The more sample points used, the sharper your clip will get. Several resizer comparison pages are given in the [[#External Links|External Links]] section.  
+
The rationale for '''Spline''' is to be as sharp as possible with less ringing artifacts than [[#LanczosResize|LanczosResize]] produces. '''Spline16Resize''' uses &radic;16 or 4 sample points, '''Spline36Resize''' uses &radic;36 or 6 sample points, etc ... The more sample points used, the more accurate the resampling. Several resizer comparison pages are given in the [[#External Links|External Links]] section.  
  
*'''Spline64Resize''' is considered the most accurate filter for enlarging or reducing by moderate amounts (less than double/half).  
+
*'''Spline64Resize''' may be the most accurate of the '''Resize''' filters.  
 
*'''Spline16Resize''' is sharper and [[Wikipedia:Ringing_artifacts|rings]] just a bit, which may be desirable with soft sources.  
 
*'''Spline16Resize''' is sharper and [[Wikipedia:Ringing_artifacts|rings]] just a bit, which may be desirable with soft sources.  
 
*'''Spline36Resize''' is somewhere in between the other two.
 
*'''Spline36Resize''' is somewhere in between the other two.

Revision as of 04:18, 1 February 2016

Scales the input video frames to an arbitrary new resolution, and optionally crops the frame before resizing with sub-pixel precision.

There are trade-offs to be considered between preservation (or augmentation) of image detail and possible artifacts (i.e., oversharpening).

Contents


Common Parameters

int  target_width, target_height =

Width and height of the returned clip.

int  src_left, src_top = 0, 0

See cropping discussion below.

Cropping of the left and top edges respectively, in pixels, before resizing.

int  src_width, src_height = (source width, height)

See cropping discussion below.

As with Crop, these arguments have different functionality, depending on their value:

  • If  > zero, these set the width and height of the clip before resizing.
  • If <= zero, they set the cropping of the right and bottom edges respectively, before resizing.

Note, there are certain limits:

  • clip.Width must be >= (src_left + width)
  • clip.Width must be >  (src_left + right)
  • clip.Height must be >= (src_top + height)
  • clip.Height must be >  (src_top + bottom)

...otherwise it would enlarge ("un-crop") the clip, or reduce width or height to 0, which is not allowed.


Cropping

  • All resizers have an expanded syntax which crops the frame before resizing:
BilinearResize(100, 150, src_left=10, src_top=10, src_width=200, src_height=300)

...or more succinctly:

BilinearResize(100, 150, 10, 10, 200, 300)

The operations are the same as if you put Crop before the Resize:

Crop(10, 10, 200, 300).BilinearResize(100, 150)
  • Note the cropping parameters are all floating point. This allows any Resize filter to be used as a sub-pixel shifter. [1]
  • Note that Crop gives a hard boundary, whereas the Resize filters interpolate pixels outside the cropped region – depending on the resizer kernel – bilinear, bicubic etc, and not beyond the edge of the image.
  • As a general rule,
    • Crop any hard borders or noise; Resize cropping may propagate the noise into the output.
    • Use Resize cropping to maintain accurate edge rendering when excising a part of a complete image.


BilinearResize

BilinearResize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height ] )

BilinearResize uses standard bilinear filtering and is a good choice for smoothing overly sharp sources.


BicubicResize

BicubicResize(clip clip, int target_width, int target_height [, float b, float c,
     float src_left, float src_top, float src_width, float src_height ] )

BicubicResize is similar to BilinearResize, except that instead of a linear filtering function it uses the Mitchell-Netravali two-part cubic. The parameters b and c can be used to adjust the properties of the cubic; they are sometimes referred to as "blurring" and "ringing" respectively.

If you are enlarging your video, you will get sharper results with BicubicResize than with BilinearResize. However, if you are shrinking it, you may prefer BilinearResize as it performs some antialiasing.

parameters b and c

float  b, c = 1/3

With b=0 and c=0.75 the filter is exactly the same as VirtualDub's precise bicubic. The default for both b and c is 1.0/3.0, which were the values recommended by Mitchell and Netravali as yielding the most visually pleasing results. Larger values of b and c can produce interesting op-art effects – for example, try b=0 and c=-5.0.

You have to set [b + 2c = 1] for the most numerically accurate filter. This gives for b=0 the maximum value of 0.5 for c, which is the Catmull-Rom spline and a good suggestion for sharpness.

As c exceeds 0.6, the filter starts to "ring" or overshoot. You won't get true sharpness – what you'll get is exaggerated edges. Negative values for b (although allowed) give undesirable results, so use b=0 for values of c > 0.5.


BlackmanResize

BlackmanResize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height, int taps ] )

BlackmanResize is a modification of LanczosResize that has better control of ringing artifacts for high numbers of taps.

parameter taps

int  taps = 4

See LanczosResize for an explanation of the taps argument (default 4, range 1-100).


GaussResize

GaussResize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height, float p ] )

GaussResize uses a gaussian resizer, which unlike the bicubics, does not overshoot – but perhaps does not appear as sharp to the eye.

parameter p

float  p = 30.0

Sharpness. Range from about 1 to 100, with 1 being very blurry and 100 being very sharp.


LanczosResize

LanczosResize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height, int taps ] )

Lanczos4Resize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height ] )

LanczosResize is a sharper alternative to BicubicResize. It is NOT suited for low bitrate video; the various Bicubic flavours are much better for this.

Lanczos4Resize is a short hand for LanczosResize(taps=4). It produces sharper images than LanczosResize with the default taps=3, especially useful when upsizing a clip.

parameter taps

int  taps = 3

Basically, taps affects sharpness. Default 3, range 1-100. Equal to the number of filter lobes (ignoring mirroring around the origin).

Note: the input argument named taps should really be called "lobes". When discussing resizers, "taps" has a different meaning, as described below:

So when people talk about Lanczos2, they mean a 2-lobe Lanczos-windowed sinc function. There are actually 4 lobes -- 2 on each side...

For upsampling (making the image larger), the filter is sized such that the entire equation falls across 4 input samples, making it a 4-tap filter. It doesn't matter how big the output image is going to be - it's still just 4 taps. For downsampling (making the image smaller), the equation is sized so it will fall across 4 *destination* samples, which obviously are spaced at wider intervals than the source samples. So for downsampling by a factor of 2 (making the image half as big), the filter covers 8 input samples, and thus 8 taps. For 3X downsampling, you need 12 taps, and so forth.

The total number of taps you need for downsampling is the downsampling ratio times the number of lobes, times 2. And practically, one needs to round that up to the next even integer. For upsampling, it's always 4 taps.

Don Munsil (avsforum post) | mirror.


PointResize

PointResize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height ] )

PointResize is the simplest resizer possible. It uses a Point Sampler or Nearest Neighbour algorithm, which usually results in a very "blocky" image. So in general this filter should only be used, if you intend to have inferior quality, or you need the clear pixel drawings. Useful for magnifying small areas for examination.


Spline based resizers

Spline16Resize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height ] )

Spline36Resize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height ] )

Spline64Resize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height ] )

Spline16Resize, Spline36Resize and Spline64Resize are three Spline based resizers. They are the (cubic) spline based resizers from Panorama tools that fit a spline through the sample points and then derives the filter kernel from the resulting blending polynomials. See this thread for the technical details.

The rationale for Spline is to be as sharp as possible with less ringing artifacts than LanczosResize produces. Spline16Resize uses √16 or 4 sample points, Spline36Resize uses √36 or 6 sample points, etc ... The more sample points used, the more accurate the resampling. Several resizer comparison pages are given in the External Links section.

  • Spline64Resize may be the most accurate of the Resize filters.
  • Spline16Resize is sharper and rings just a bit, which may be desirable with soft sources.
  • Spline36Resize is somewhere in between the other two.


SincResize

SincResize(clip clip, int target_width, int target_height [,
     float src_left, float src_top, float src_width, float src_height, int taps ] )

SincResize uses the truncated sinc function. It is very sharp, but prone to ringing artifacts.

parameter taps

int  taps = 4

See LanczosResize for an explanation of the taps argument (default 4, range 1-20).


Examples

  • Cropping:
Crop(10, 10, 200, 300).BilinearResize(100, 150)

which is nearly the same as:

BilinearResize(100, 150, 10, 10, 200, 300)
  • Load a video file and resize it to 240x180 (from whatever it was before)
AviSource("video.avi").BilinearResize(240,180)
  • Load a 720x480 (Rec. 601) video and resize it to 352x240 (VCD), preserving the correct aspect ratio
AviSource("dv.avi").BilinearResize(352, 240, 8, 0, 704, 480)

which is the same as:

AviSource("dv.avi").BilinearResize(352, 240, 8, 0, -8, -0)
  • Extract the upper-right quadrant of a 320x240 video and zoom it to fill the whole frame
BilinearResize(320, 240, 160, 0, 160, 120)


Notes

  • AviSynth has completely separate vertical and horizontal resizers. If input is the same as output on one axis, that resizer will be skipped. The resizer with the smallest downscale ratio is called first; this is done to preserve maximum quality, so the second resizer has the best possible picture to work with. Data storing will have an impact on what mods should be used for sizes when resizing and cropping; see Crop restrictions.


External Links


Changelog

v2.60 Added SincResize.
v2.58 Added BlackmanResize, Spline64Resize.
v2.56 Added Spline16Resize, Spline36Resize, GaussResize and taps parameter in LanczosResize; added offsets in Crop part of xxxResize.
v2.55 Added Lanczos4Resize.
Personal tools