Page 2 Back to the top
be guaranteed. In some imaging domains, however, labeled data with pixel- or voxel-level label accuracy
are scarce due to the cost of acquiring them. This problem is exacerbated further in domains like medical
imaging, where there is no single ground truth label, resulting in large amounts of repeat variability in the
labels. In this work, we propose “supervision by denoising” (SUD), a framework that enables us to supervise
reconstruction models using their own denoised output as soft labels. SUD unifies stochastic averaging and
spatial denoising techniques under a spatio-temporal denoising framework and alternates denoising and
model weight update steps in an optimization framework for semi-supervision. [Paper]
Transform quantization for CNN compression. IEEE Trans Pattern Anal Mach Intell, 2022. In this work, we
compress convolutional neural network (CNN) post-training via transform quantization. CNN quantization
techniques often ignore the joint statistics of weights and activations, producing sub-optimal CNN
performance at a given bit-rate, or consider their joint statistics during training only and do not facilitate
efficient compression of already trained CNN models. The proposed transform quantization framework
unifies quantization and dimensionality reduction (decorrelation) techniques in a single framework to
facilitate low bit-rate compression of CNNs and efficient inference in the transform domain. We first
introduce a theory of rate and distortion for CNN quantization, and pose optimum quantization as a rate-
distortion optimization problem. We then show that this problem can be solved using optimal bit-depth
allocation following decorrelation by the optimal End-to-end Learned Transform (ELT). [Paper]
Fast optical flow extraction from compressed video. IEEE Trans Image Process, 2020. We propose the fast
optical flow extractor, a filtering method that recovers artifact-free optical flow fields from HEVC-
compressed video. To extract accurate optical flow fields, we form a regularized optimization problem that
considers the smoothness of the solution and the pixelwise confidence weights of an artifact-ridden HEVC
motion field. Solving such an optimization problem is slow, so we first convert the problem into a
confidence-weighted filtering task. By leveraging the already-available HEVC motion parameters, we
achieve a 100-fold speed-up in the running times compared to similar methods, while producing subpixel-
accurate flow estimates. The fast optical flow extractor is useful when video frames are already available in
coded formats. Our method is not specific to a coder, and works with motion fields from video coders such
as H.264/AVC and HEVC. [Paper]