public class SDImage extends SDOps
| Modifier and Type | Method and Description |
|---|---|
SDVariable |
adjustContrast(SDVariable in,
double factor)
Adjusts contrast of RGB or grayscale images.
|
SDVariable |
adjustContrast(String name,
SDVariable in,
double factor)
Adjusts contrast of RGB or grayscale images.
|
SDVariable |
adjustHue(SDVariable in,
double delta)
Adjust hue of RGB image
|
SDVariable |
adjustHue(String name,
SDVariable in,
double delta)
Adjust hue of RGB image
|
SDVariable |
adjustSaturation(SDVariable in,
double factor)
Adjust saturation of RGB images
|
SDVariable |
adjustSaturation(String name,
SDVariable in,
double factor)
Adjust saturation of RGB images
|
SDVariable |
cropAndResize(SDVariable image,
SDVariable cropBoxes,
SDVariable boxIndices,
SDVariable cropOutSize)
Given an input image and some crop boxes, extract out the image subsets and resize them to the specified size.
|
SDVariable |
cropAndResize(SDVariable image,
SDVariable cropBoxes,
SDVariable boxIndices,
SDVariable cropOutSize,
double extrapolationValue)
Given an input image and some crop boxes, extract out the image subsets and resize them to the specified size.
|
SDVariable |
cropAndResize(String name,
SDVariable image,
SDVariable cropBoxes,
SDVariable boxIndices,
SDVariable cropOutSize)
Given an input image and some crop boxes, extract out the image subsets and resize them to the specified size.
|
SDVariable |
cropAndResize(String name,
SDVariable image,
SDVariable cropBoxes,
SDVariable boxIndices,
SDVariable cropOutSize,
double extrapolationValue)
Given an input image and some crop boxes, extract out the image subsets and resize them to the specified size.
|
SDVariable |
extractImagePatches(SDVariable image,
int[] kSizes,
int[] strides,
int[] rates,
boolean sameMode)
Given an input image, extract out image patches (of size kSizes - h x w) and place them in the depth dimension.
|
SDVariable |
extractImagePatches(String name,
SDVariable image,
int[] kSizes,
int[] strides,
int[] rates,
boolean sameMode)
Given an input image, extract out image patches (of size kSizes - h x w) and place them in the depth dimension.
|
SDVariable |
hsvToRgb(SDVariable input)
Converting image from HSV to RGB format
|
SDVariable |
hsvToRgb(String name,
SDVariable input)
Converting image from HSV to RGB format
|
SDVariable |
imageResize(SDVariable input,
SDVariable size,
boolean preserveAspectRatio,
boolean antialis,
ImageResizeMethod ImageResizeMethod)
Resize images to size using the specified method.
|
SDVariable |
imageResize(SDVariable input,
SDVariable size,
ImageResizeMethod ImageResizeMethod)
Resize images to size using the specified method.
|
SDVariable |
imageResize(String name,
SDVariable input,
SDVariable size,
boolean preserveAspectRatio,
boolean antialis,
ImageResizeMethod ImageResizeMethod)
Resize images to size using the specified method.
|
SDVariable |
imageResize(String name,
SDVariable input,
SDVariable size,
ImageResizeMethod ImageResizeMethod)
Resize images to size using the specified method.
|
SDVariable |
nonMaxSuppression(SDVariable boxes,
SDVariable scores,
int maxOutSize,
double iouThreshold,
double scoreThreshold)
Greedily selects a subset of bounding boxes in descending order of score
|
SDVariable |
nonMaxSuppression(String name,
SDVariable boxes,
SDVariable scores,
int maxOutSize,
double iouThreshold,
double scoreThreshold)
Greedily selects a subset of bounding boxes in descending order of score
|
SDVariable |
randomCrop(SDVariable input,
SDVariable shape)
Randomly crops image
|
SDVariable |
randomCrop(String name,
SDVariable input,
SDVariable shape)
Randomly crops image
|
SDVariable |
rgbToHsv(SDVariable input)
Converting array from HSV to RGB format
|
SDVariable |
rgbToHsv(String name,
SDVariable input)
Converting array from HSV to RGB format
|
SDVariable |
rgbToYiq(SDVariable input)
Converting array from RGB to YIQ format
|
SDVariable |
rgbToYiq(String name,
SDVariable input)
Converting array from RGB to YIQ format
|
SDVariable |
rgbToYuv(SDVariable input)
Converting array from RGB to YUV format
|
SDVariable |
rgbToYuv(String name,
SDVariable input)
Converting array from RGB to YUV format
|
SDVariable |
yiqToRgb(SDVariable input)
Converting image from YIQ to RGB format
|
SDVariable |
yiqToRgb(String name,
SDVariable input)
Converting image from YIQ to RGB format
|
SDVariable |
yuvToRgb(SDVariable input)
Converting image from YUV to RGB format
|
SDVariable |
yuvToRgb(String name,
SDVariable input)
Converting image from YUV to RGB format
|
public SDImage(SameDiff sameDiff)
public SDVariable cropAndResize(SDVariable image, SDVariable cropBoxes, SDVariable boxIndices, SDVariable cropOutSize, double extrapolationValue)
image - Input image, with shape [batch, height, width, channels] (NUMERIC type)cropBoxes - Float32 crop, shape [numBoxes, 4] with values in range 0 to 1 (NUMERIC type)boxIndices - Indices: which image (index to dimension 0) the cropBoxes belong to. Rank 1, shape [numBoxes] (NUMERIC type)cropOutSize - Output size for the images - int32, rank 1 with values [outHeight, outWidth] (INT type)extrapolationValue - Used for extrapolation, when applicable. 0.0 should be used for the defaultpublic SDVariable cropAndResize(String name, SDVariable image, SDVariable cropBoxes, SDVariable boxIndices, SDVariable cropOutSize, double extrapolationValue)
name - name May be null. Name for the output variableimage - Input image, with shape [batch, height, width, channels] (NUMERIC type)cropBoxes - Float32 crop, shape [numBoxes, 4] with values in range 0 to 1 (NUMERIC type)boxIndices - Indices: which image (index to dimension 0) the cropBoxes belong to. Rank 1, shape [numBoxes] (NUMERIC type)cropOutSize - Output size for the images - int32, rank 1 with values [outHeight, outWidth] (INT type)extrapolationValue - Used for extrapolation, when applicable. 0.0 should be used for the defaultpublic SDVariable cropAndResize(SDVariable image, SDVariable cropBoxes, SDVariable boxIndices, SDVariable cropOutSize)
image - Input image, with shape [batch, height, width, channels] (NUMERIC type)cropBoxes - Float32 crop, shape [numBoxes, 4] with values in range 0 to 1 (NUMERIC type)boxIndices - Indices: which image (index to dimension 0) the cropBoxes belong to. Rank 1, shape [numBoxes] (NUMERIC type)cropOutSize - Output size for the images - int32, rank 1 with values [outHeight, outWidth] (INT type)public SDVariable cropAndResize(String name, SDVariable image, SDVariable cropBoxes, SDVariable boxIndices, SDVariable cropOutSize)
name - name May be null. Name for the output variableimage - Input image, with shape [batch, height, width, channels] (NUMERIC type)cropBoxes - Float32 crop, shape [numBoxes, 4] with values in range 0 to 1 (NUMERIC type)boxIndices - Indices: which image (index to dimension 0) the cropBoxes belong to. Rank 1, shape [numBoxes] (NUMERIC type)cropOutSize - Output size for the images - int32, rank 1 with values [outHeight, outWidth] (INT type)public SDVariable adjustContrast(SDVariable in, double factor)
in - images to adjust. 3D shape or higher (NUMERIC type)factor - multiplier for adjusting contrastpublic SDVariable adjustContrast(String name, SDVariable in, double factor)
name - name May be null. Name for the output variablein - images to adjust. 3D shape or higher (NUMERIC type)factor - multiplier for adjusting contrastpublic SDVariable adjustHue(SDVariable in, double delta)
in - image as 3D array (NUMERIC type)delta - value to add to hue channelpublic SDVariable adjustHue(String name, SDVariable in, double delta)
name - name May be null. Name for the output variablein - image as 3D array (NUMERIC type)delta - value to add to hue channelpublic SDVariable adjustSaturation(SDVariable in, double factor)
in - RGB image as 3D array (NUMERIC type)factor - factor for saturationpublic SDVariable adjustSaturation(String name, SDVariable in, double factor)
name - name May be null. Name for the output variablein - RGB image as 3D array (NUMERIC type)factor - factor for saturationpublic SDVariable extractImagePatches(SDVariable image, int[] kSizes, int[] strides, int[] rates, boolean sameMode)
image - Input image to extract image patches from - shape [batch, height, width, channels] (NUMERIC type)kSizes - Kernel size - size of the image patches, [height, width] (Size: Exactly(count=2))strides - Stride in the input dimension for extracting image patches, [stride_height, stride_width] (Size: Exactly(count=2))rates - Usually [1,1]. Equivalent to dilation rate in dilated convolutions - how far apart the output pixels
in the patches should be, in the input. A dilation of [a,b] means every ath pixel is taken
along the height/rows dimension, and every bth pixel is take along the width/columns dimension (Size: AtLeast(min=0))sameMode - Padding algorithm. If true: use Same paddingpublic SDVariable extractImagePatches(String name, SDVariable image, int[] kSizes, int[] strides, int[] rates, boolean sameMode)
name - name May be null. Name for the output variableimage - Input image to extract image patches from - shape [batch, height, width, channels] (NUMERIC type)kSizes - Kernel size - size of the image patches, [height, width] (Size: Exactly(count=2))strides - Stride in the input dimension for extracting image patches, [stride_height, stride_width] (Size: Exactly(count=2))rates - Usually [1,1]. Equivalent to dilation rate in dilated convolutions - how far apart the output pixels
in the patches should be, in the input. A dilation of [a,b] means every ath pixel is taken
along the height/rows dimension, and every bth pixel is take along the width/columns dimension (Size: AtLeast(min=0))sameMode - Padding algorithm. If true: use Same paddingpublic SDVariable hsvToRgb(SDVariable input)
input - 3D image (NUMERIC type)public SDVariable hsvToRgb(String name, SDVariable input)
name - name May be null. Name for the output variableinput - 3D image (NUMERIC type)public SDVariable imageResize(SDVariable input, SDVariable size, boolean preserveAspectRatio, boolean antialis, ImageResizeMethod ImageResizeMethod)
input - 4D image [NHWC] (NUMERIC type)size - new height and width (INT type)preserveAspectRatio - Whether to preserve the aspect ratio. If this is set, then images will be resized to a size that fits in size while preserving the aspect ratio of the original image. Scales up the image if size is bigger than the current size of the image. Defaults to False.antialis - Whether to use an anti-aliasing filter when downsampling an imageImageResizeMethod - ResizeBilinear: Bilinear interpolation. If 'antialias' is true, becomes a hat/tent filter function with radius 1 when downsampling.
ResizeLanczos5: Lanczos kernel with radius 5. Very-high-quality filter but may have stronger ringing.
ResizeBicubic: Cubic interpolant of Keys. Equivalent to Catmull-Rom kernel. Reasonably good quality and faster than Lanczos3Kernel, particularly when upsampling.
ResizeGaussian: Gaussian kernel with radius 3, sigma = 1.5 / 3.0.
ResizeNearest: Nearest neighbor interpolation. 'antialias' has no effect when used with nearest neighbor interpolation.
ResizeArea: Anti-aliased resampling with area interpolation. 'antialias' has no effect when used with area interpolation; it always anti-aliases.
ResizeMitchelcubic: Mitchell-Netravali Cubic non-interpolating filter. For synthetic images (especially those lacking proper prefiltering), less ringing than Keys cubic kernel but less sharp.public SDVariable imageResize(String name, SDVariable input, SDVariable size, boolean preserveAspectRatio, boolean antialis, ImageResizeMethod ImageResizeMethod)
name - name May be null. Name for the output variableinput - 4D image [NHWC] (NUMERIC type)size - new height and width (INT type)preserveAspectRatio - Whether to preserve the aspect ratio. If this is set, then images will be resized to a size that fits in size while preserving the aspect ratio of the original image. Scales up the image if size is bigger than the current size of the image. Defaults to False.antialis - Whether to use an anti-aliasing filter when downsampling an imageImageResizeMethod - ResizeBilinear: Bilinear interpolation. If 'antialias' is true, becomes a hat/tent filter function with radius 1 when downsampling.
ResizeLanczos5: Lanczos kernel with radius 5. Very-high-quality filter but may have stronger ringing.
ResizeBicubic: Cubic interpolant of Keys. Equivalent to Catmull-Rom kernel. Reasonably good quality and faster than Lanczos3Kernel, particularly when upsampling.
ResizeGaussian: Gaussian kernel with radius 3, sigma = 1.5 / 3.0.
ResizeNearest: Nearest neighbor interpolation. 'antialias' has no effect when used with nearest neighbor interpolation.
ResizeArea: Anti-aliased resampling with area interpolation. 'antialias' has no effect when used with area interpolation; it always anti-aliases.
ResizeMitchelcubic: Mitchell-Netravali Cubic non-interpolating filter. For synthetic images (especially those lacking proper prefiltering), less ringing than Keys cubic kernel but less sharp.public SDVariable imageResize(SDVariable input, SDVariable size, ImageResizeMethod ImageResizeMethod)
input - 4D image [NHWC] (NUMERIC type)size - new height and width (INT type)ImageResizeMethod - ResizeBilinear: Bilinear interpolation. If 'antialias' is true, becomes a hat/tent filter function with radius 1 when downsampling.
ResizeLanczos5: Lanczos kernel with radius 5. Very-high-quality filter but may have stronger ringing.
ResizeBicubic: Cubic interpolant of Keys. Equivalent to Catmull-Rom kernel. Reasonably good quality and faster than Lanczos3Kernel, particularly when upsampling.
ResizeGaussian: Gaussian kernel with radius 3, sigma = 1.5 / 3.0.
ResizeNearest: Nearest neighbor interpolation. 'antialias' has no effect when used with nearest neighbor interpolation.
ResizeArea: Anti-aliased resampling with area interpolation. 'antialias' has no effect when used with area interpolation; it always anti-aliases.
ResizeMitchelcubic: Mitchell-Netravali Cubic non-interpolating filter. For synthetic images (especially those lacking proper prefiltering), less ringing than Keys cubic kernel but less sharp.public SDVariable imageResize(String name, SDVariable input, SDVariable size, ImageResizeMethod ImageResizeMethod)
name - name May be null. Name for the output variableinput - 4D image [NHWC] (NUMERIC type)size - new height and width (INT type)ImageResizeMethod - ResizeBilinear: Bilinear interpolation. If 'antialias' is true, becomes a hat/tent filter function with radius 1 when downsampling.
ResizeLanczos5: Lanczos kernel with radius 5. Very-high-quality filter but may have stronger ringing.
ResizeBicubic: Cubic interpolant of Keys. Equivalent to Catmull-Rom kernel. Reasonably good quality and faster than Lanczos3Kernel, particularly when upsampling.
ResizeGaussian: Gaussian kernel with radius 3, sigma = 1.5 / 3.0.
ResizeNearest: Nearest neighbor interpolation. 'antialias' has no effect when used with nearest neighbor interpolation.
ResizeArea: Anti-aliased resampling with area interpolation. 'antialias' has no effect when used with area interpolation; it always anti-aliases.
ResizeMitchelcubic: Mitchell-Netravali Cubic non-interpolating filter. For synthetic images (especially those lacking proper prefiltering), less ringing than Keys cubic kernel but less sharp.public SDVariable nonMaxSuppression(SDVariable boxes, SDVariable scores, int maxOutSize, double iouThreshold, double scoreThreshold)
boxes - Might be null. Name for the output variable (NUMERIC type)scores - vector of shape [num_boxes] (NUMERIC type)maxOutSize - scalar representing the maximum number of boxes to be selectediouThreshold - threshold for deciding whether boxes overlap too much with respect to IOUscoreThreshold - threshold for deciding when to remove boxes based on scorepublic SDVariable nonMaxSuppression(String name, SDVariable boxes, SDVariable scores, int maxOutSize, double iouThreshold, double scoreThreshold)
name - name May be null. Name for the output variableboxes - Might be null. Name for the output variable (NUMERIC type)scores - vector of shape [num_boxes] (NUMERIC type)maxOutSize - scalar representing the maximum number of boxes to be selectediouThreshold - threshold for deciding whether boxes overlap too much with respect to IOUscoreThreshold - threshold for deciding when to remove boxes based on scorepublic SDVariable randomCrop(SDVariable input, SDVariable shape)
input - input array (NUMERIC type)shape - shape for crop (INT type)public SDVariable randomCrop(String name, SDVariable input, SDVariable shape)
name - name May be null. Name for the output variableinput - input array (NUMERIC type)shape - shape for crop (INT type)public SDVariable rgbToHsv(SDVariable input)
input - 3D image (NUMERIC type)public SDVariable rgbToHsv(String name, SDVariable input)
name - name May be null. Name for the output variableinput - 3D image (NUMERIC type)public SDVariable rgbToYiq(SDVariable input)
input - 3D image (NUMERIC type)public SDVariable rgbToYiq(String name, SDVariable input)
name - name May be null. Name for the output variableinput - 3D image (NUMERIC type)public SDVariable rgbToYuv(SDVariable input)
input - 3D image (NUMERIC type)public SDVariable rgbToYuv(String name, SDVariable input)
name - name May be null. Name for the output variableinput - 3D image (NUMERIC type)public SDVariable yiqToRgb(SDVariable input)
input - 3D image (NUMERIC type)public SDVariable yiqToRgb(String name, SDVariable input)
name - name May be null. Name for the output variableinput - 3D image (NUMERIC type)public SDVariable yuvToRgb(SDVariable input)
input - 3D image (NUMERIC type)public SDVariable yuvToRgb(String name, SDVariable input)
name - name May be null. Name for the output variableinput - 3D image (NUMERIC type)Copyright © 2021. All rights reserved.