Tuesday, March 29, 2016

ML Meeting Notes & Research Opportunities

 
Introduction & Progress Update
 
We finally had a long-due gathering of machine learning late afternoon yesterday. Even though some of our members had college class, piano class, etc.. We still had eight members attended the meeting including new friends! Mr. Lin went through the basic introduction of Artificial Neural Networks (ANN), Perceptrons, and Multi-Layer Perceptrons (MLP). He also showed the current progress of Machine Learning (ML) research group. It can be summarized as the following:
  • Perceptron has been studied, understood, implemented in C code, and proved to be functioning. It can be used to separate two linearly separable classes.
  • Perceptron algorithm has been used to distinguish hand gestures between thumb and pinky by the Electromyography (EMG) signals acquired from two muscles in the forearm. This "Hand-to-Hand Communication" project is a current project of Advanced STEM Research class.
  • Multi-Layer Perceptron has been studied, partially understood, and implemented in C code. It has been trained (20,000 samples) and tested (2,500 samples) using artificially generated data set including five classes in the 3D space. So far, it can separate the testing data properly, and the error rate is 2.48 %. 
  • The current focus is to apply the MLP to the "Hand-to-Hand Communication" project such that we can recognize hand gestures form all five fingers.
You can view the detailed presentation by clicking the link

Research Opportunities

At this moment, four research opportunities are available for students to ramp up the ML learning curve and develop skill sets in specific fields. 
  1. Subway Signs Recognition: We will start from recognizing train numbers using MLP. Some image pre-processing and feature extraction will be provided to create the feature vectors as the input of the MLP algorithm. Our goal is to recognize all the subway signs by classification.
  2. Drone Tracking: We will use MLP to find the location (3D coordinates) of a flying drone through the positions (2D coordinates) in the image planes of multi-angle cameras by regression
  3. Bioinformatics: Bioinformatics is the application of computer technology to the management of biological information. Computers are used to gather, store, analyze and integrate biological and genetic information which can then be applied to gene-based drug discovery and development. We are in the process of forming a bioinformatics project with the Evolutionary Bioinformatics Lab at Hunter College.
  4. Watson Analytics: IBM is actively developing their Watson-based business. Harnessing the power of Watson to various fields in our lives depends on software applications. We may be able enter the Watson world by learning and developing apps on this commercially available platform.   
So far, a few students have decided to sign up specific activities:
  1. Subway Signs Recognition: Qin Ying, Tai Wei and Alan.
  2. Drone Tracking: (open).
  3. Bioinformatics: Belinda.
  4. Watson Analytics: Michael.
For people who haven't signed up specific projects yet or people who have missed the meeting, you can email Mr. Lin your choice. You are more than welcome to review the presentation and discuss the details of the research opportunities with Mr. Lin. Mr. Lin will soon post further information for each team to start working on their project!

Thursday, March 24, 2016

Invitation: Machine Learning Group Gathering

Hi All,

It has been a while since our last meeting. Even though it seems to be relatively quiet on the blog, some substantial progresses have been made in the past few weeks. The Perceptron has been understood, coded, and applied to investigate a finger gesture recognition problem encountered in the Advanced STEM Research class. The first version of Multi-Layer Perception (MLP) has been implemented. The results of initial training and testing based on artificially generated data sets have been very promising. It will soon be applied to several projects including recognizing finger gestures, locating drone positions, and recognizing subway signs!

I believe that it's a good timing for us to gather together, share what we have learned, and plan for the next step such that every member can be benefited from the information and knowledge accumulated in this research, build skills through hands-on exercises, and contribute to the Machine Learning Group in a substantial way in the coming months.

You are invited to attend the Machine Learning Group Gathering at 4:00 pm on Monday (03/28) in Room 417. You may spread the words to anyone of your friends who is also interested. I am looking forward to seeing you then!

Sunday, March 13, 2016

Machine Learning in Healthcare

I attended a technical symposium yesterday hosted by the CIE-USA (Chinese Institute of Engineers-USA). One of the most popular plenary presentation was delivered by Dr. Jianying Hu (a Distinguished Research Staff Member and Senior Manger of Health Informatics Research at IBM T. J. Watson Research Center, NY) about "Data Driven Healthcare Analytics". Her team leveraged the data existing in the healthcare ecosystem, and developed various machine learning-based tools to help players such as doctors and medical directors in this ecosystem to make better decisions in care delivery and care management. For example, the Care Pathway Explorer is a system that mines and visualizes a set of frequent event sequences from patient EMR (Electronic Medical Records) data. The data of more than 200,000 patients over three years time frame has been analyzed in her research. 
Overview of the Care Pathway Explorer system: It contains four major modules: Event Database, Data Preprocessor, Frequent Pattern Analytics, and a Visual Interface.
"The goal is to utilize historical EMR data to extract common sequences of medical events such as diagnoses and treatments, and investigate how these sequences correlate with patient outcome." Patient similarity has been extracted through machine learning techniques to help decision making process. 
Care Pathway Explorer: A bubble chart and displays events of the most frequent patterns mined (left), and a flow visualization to show the most frequent patterns (right).
Insights of Care Pathway Explorer: Examples of insights reached interactively when mining patterns from patients with a hypertension pre-condition before the initial hyperlipidemi a diagnosis (2,800 patients, featuring 14,979 hyperlipidemia-related diagnoses and 24,898 medication events).
Machine learning has been integrated into the healthcare process and is ready for commercial deployment. I was so impressed to see how this new technology has started impacting the healthcare system. Though the speaker insisted that the system will only be a powerful tool for doctors to improve their efficiency, there were lots of feedback and comments about its potential to dramatically change (or even replace) doctors' roles in the healthcare system in the years to come.

I found a YouTube video and a paper in technical journal include the similar information shared by Dr. Jianying Hu yesterday. If you are interested in this field, you should take a look. Besides healthcare, machine learning has potential to change many other aspects of our lives. It encourages me to spend more time to catch up this fast-growing technology.