Rhythm Vohra

BEng(Punjab University, 2019)

Notice of the Final Oral Examination for the Degree of Master of Applied Science

Topic

Single-Class Instance Segmentation for Vectorization of Line Drawings

Department of Electrical and Computer Engineering

Date & location

Tuesday, January 30, 2024
10:00 A.M.
Engineering Computer Science Building
Room 468 and Virtual

Reviewers

Supervisory Committee

Dr. Alexandra Branzan Albu, Department of Electrical and Computer Engineering, University of Victoria (Supervisor)
Dr. Pan Agathoklis, Department of Electrical and Computer Engineering, UVic (Unit Member)

External Examiner

Dr. Miguel Nacenta, Department of Computer Science, University of Victoria

Chair of Oral Examination

Dr. Elisabeth Gugl, Department of Economics, UVic

Abstract

Images can be represented and stored either in raster or in vector formats. Raster images are the most ubiquitous and are defined as matrices of pixel intensities/colours, while vector images consist of a finite set of geometric primitives, such as lines, curves, and polygons. Since geometric shapes are expressed via mathematical equations and defined by a limited number of control points, they can be manipulated in a much easier way than by directly working with pixels; hence, the vector format is much preferred to raster for image editing and understanding purposes. The conversion of a raster image into its vector correspondent is a non-trivial process, called image vectorization. Creating vector images from a given raster image can be time-consuming and requires the expertise of a skilled graphic user. This thesis explores the effectiveness of a Deep Learning (DL) based framework to vectorize raster images comprising line drawings with minimal user interventions. To improve the visual representation of the image, each stroke in the line drawing is represented with a different label and vectorized. In this document, we present an in-depth study of image vectorization, the objective of our research, challenges, potential resolutions, and compare the outcomes of our approach on six datasets consisting of different types of hand drawings. More specifically, this thesis begins by comparing raster images with vector images, the importance of image vectorization, and our objective to convert raster images to vector based representations by accurately separating each stroke from the line drawings. In further chapters of this thesis, a Deep Learning (DL) based segmentation methodology is introduced to perform Single-Class Instance Segmentation (SCIS) of hand drawings to process the input raster image by labeling each pixel as belonging to a particular stroke instance. This segmentation approach is able to leverage the spatial relationships between each stroke instance.

A novel loss function specifically designed to optimize our highly imbalanced datasets by scaling the margins and adding a regularization term to improve its feature selection technique. The weighted combination of our proposed margin regularized loss function is combined with the Dice loss to reduce the spatial overlap and improve the predictions over infrequent labels.

Finally, the effectiveness of our segmentation technique of line drawing vectorization is compared experimentally with the state-of-the-art and our reference method. Our method can successfully handle a wide variety of human drawing styles. The results are comparable in terms of accuracy and way ahead in terms of speed and complexity, with other methods.

Back to oral exams