Small Simplicity

Understanding Intelligence from Computational Perspective

Projects

My previous projects, tutorials, talks


Tutorial

Experimental Machine Learning with Holoviews, Geoviews and PyTorch

1.5hr tutorial session @ PyData LA, 2019 - materials, video

This tutorial introduces how to make your data exploration and model building process more interactive and exploratory by using the combination of JupyterLab, HoloViews, and PyTorch. I will focus on the problem of classfying different types of roads on satellite images, defined as a "multi-class semantic segmentation problem". Starting from the data exploration to the trained model understanding, we will cover different ways to explore the data and models with simple, interactive GUIs in Jupyter notebooks. Specifically, the tutorial covers, with the emphasis on the experimental nature of model building:

  • how to make your data exploration more intuitive and experimental using HoloViews libraries
  • how to turn your model script into a simple GUI that allows interactive hyperparameter tuning and model exploration
  • how to monitor the training process in realtime
  • how to quickly build a GUI tool to inspect the trained models in the same Jupyter notebook
pydata-0 pydata-1 pydata-1-2
pydata-2 pydata-3 pydata-4


MINT NetCDF

website

We proposed a self-describing data format for structured gridded datasets for MINT data catalog and visualization based on the NetCDF and the CF convention. The purpose of this specification is to establish a unified data format within MINT (and in the near future, among World Modelers and broader scientific community) for an efficient data exchange and knowledge discovery. The document proposes three levels of specification ("Mandatory", "Recommended" and "Optional") for metadata related to space, time and domain-specific semantics of the data. It provides a unified convention on which information to document as well as a specific format to represent/store the information. As a result, it facilitates the creation and sharing of scientific data across different scientific domains such as meteorology, oceanography and GIS, and also has a potential to contribute to novel, interdisciplinary discoveries.

In addition to the proposal, we have created an interactive tool for exploring datasets conforming to this specification, called MINT-GeoViz.


MINT-GeoViz

code, demo

MINT-GeoViz is an interactive visualization library for large geospatial datasets that follow our proposed MINT NetCDF convention. Some examples of such datasets include a collection of year-long satellite images in Africa, global oceanographic time series and hydrographic measurements. Using efficient data access (via Dask), parallelized computation (via Numba, DataShader) and accurate visualization techniques (via DataShader, ColorCat), it works with datasets of millions or billions of data points in real-time. For example, a user can visualize the entire earthquake dataset (with 2.1 million seismological events) on a global map. Our tool goes a step further by allowing the user to perform new computations as they explore the visualization, eg. computing aggregated statistics, transforming high dimensional data to a time series.

fldas-demo-1 fldas-demo-1
fldas-demo-2 fldas-demo-3


Generating Gaussian, Pictures, and Stories with Generative Adversarial Networks

Hayley Song*, Adam Yala*

paper

In this paper, we explore the framework of Adversarial Training as introduced in the original paper by Goodfellow et al.. Generative Aversarial Network (GAN) is a semi-supervised training method and has shown promising results in various tasks such as Image Generation, Transfer Learning, Imitation Learning and Text Generation. We aim to expose the issues and suceesses of GANs while experimenting through diverse generation tasks. Specifically, we work through generating a Gaussian distribution, images, and texts. For each experiement, we investigate the effects of parameters (e.g pre-training of the discriminator) on the convergence and the performance of the adversarial nets.

gan


Cost-effective 3D Reconstruction of Human Arms Using RGBD Sensor for Early Diagnosis of Lymphedema

Hayley Song, Julian Straub, Erik Nyguen, Fernando Yordan, Regina Barzilay

Arm lymphedema is an incurable condition characterized by large swelling of the arms and is commonly seen in breast cancer survivors. Early detection is crucial for treatment to be effective in mitigating painful symptoms, and yet there is no method that allows physicians to monitor the progression of lymphedema in a way that is cost effective and reliable. This study proposes a general purpose device that reconstructs a 3D model of objects in view of multiple RGB-D sensors. This device reconstructs a patient's arm over time which directly extends to measuring and tracking volume change over time. The accuracy of our system is comparable to the gold standard of water displacement, and is better than the state-of-the-art Perometer. However it costs significantly less than the Perometer, and is more hygienic than the water displacement method. It contributes to both short-term and long-term care for the lymphedema patients. Medical practitioners will be able to provide more timely treatment via accurate reconstruction and volume calculation. Further the generation of reliable data from our device will enable future studies of lymphedema and its progression.

3d_recon_prototype


Automatic Cell Detection using HOG features and SVM

Nicha Apichitsopa*, Boying Meng*, Hayley Song*

paper, slides (2016 Fall)

The analysis of cell trajectories inside microchannels is a critical part of many microfluidic systems. This task requires automated cell detection and cell tracking algorithms in order to reliably extract cell positions over time. Such algorithms need be robust against shape deformation, variable illumination, and noises from sensors and cell movements. In this report, we prove the ability of our machine-learning detection algorithm based on the Histogram of Oriented Gradients (HOG) and Support Vector Machines (SVMs), and compare its performances to the classic image segmentation method. We also investiage different features extracted by various machine learning algorithms and discuss how they affect the performances of cell detection and tracking.

cell-detection-1 cell-detection-


FreeFlow: Unintrusive Reading Device for Printed Texts

Hayley Song, Suvrit Sra

paper, poster (2016 Spring)

free-flow-demo free-flow-workflow

FreeFlow is a software for the pen-style, hand-held device that allows a user to search for definitions by “clicking” on the printed text. It is the first end-to-end system that performs such functions with high accuracy (95%) under variable illumination and motion blur from hand movements. It is composed of the four main modules: 1) capture, 2) preprocessing, 3) recogtnition and 4) dictionary search. In this paper, we discuss the details of our system and its performances under various real-world settings.


3D Air Gesture Recognizer using Dynamic Time Warping and KNN

Hayley Song*, Chongyuan Xiang*

poster, code and dataset (MIT CSAIL, 2016)

In this project, we use the Dynamic Time Warping method and the Neearest Neighbor to design a recognizer for 3D alphabet gestures drawn in the air. We collected the air gesture data from 11 users, and designed the features using speed, acceleration and rotation in the three dimensional space.

3d-air-gesture-workflow


(*: Equal contribution)