I led a team of eight cross-functional specialists to develop and deploy Philips'
Intellispace Genomics product at the nation's tier-one cancer centers. This includes a $6
million USD project that I was responsible for, a clinical trial matching tool for cancer
patients
The problem of clinical trial matching is to extract the relevant eligibility criteria from
more than 40,000 clinical trial protocols and then match them to the given cancer patient's
profile. The relevant matching criteria include cancer type, staging, genetic mutations, the
patient's demographics, and comorbidities. This is an extremely complex NLP problem applied
across big data.
With my team, I evolved a naive, Elasticsearch approach to a pipeline using a hybrid of
Named Entity Recognition (NER) and logical satisfiability theory. We successfully trained a
Long Short-Term Memory neural network (LSTM) with a Conditional Random Field (CRF) output
layer using clinical domain informed corpora as word embedding. As a result of our work, my
team successfully achieved more than 95% accuracy in automated clinical trial matching—as
validated by pathologists and oncologists.
The project yielded impressive results that saved tremendous time for clinicians. It is now
a commercial success, replacing IBM Watson and deployed as part of the Philips Intellispace
Genomics solution at the nation's top cancer institutions such as Dana-Farber, MD Anderson,
Westchester Medical Center, and Boston Children's hospital.