🚀

Mars (Shih-Cheng) Huang

Ph.D. Candidate @

Stanford University

Hi there - I am a PhD candidate in Biomedical Informatics at Stanford University, studying artificial intelligence and clinical informatics. I am currently advised by Serena Yeung, Curtis Langlotz, Nigam Shah and previously by Matthew P. Lungren. I am affiliated with Stanford’s MARVL lab and AIMI Center, and have experience working at Google Research , Microsoft Research , Salesforce AI research and The ChanZuckerberg Initiative.

My research focuses on the intersection of multimodal and self-supervised learning, and the application of these methods to improve healthcare.

Featured Publications

Yuhui Zhang, Jeff Z HaoChen, Mars (Shih-Cheng) Huang, Kuan-Chieh Wang, James Zou, Serena Yeung

January, 2023 ICLR

DrML: Diagnosing and Rectifying Vision Models using Language

The traditional process of diagnosing model behaviors in deployment settings involves labor-intensive data acquisition and annotation. Our proposed method, DrML, can discover high-error data slices, identify influential attributes and further rectify undesirable model behaviors, without requiring any visual data. Through a combination of theoretical explanation and empirical verification, we present conditions under which classifiers trained on embeddings from one modality can be equivalently applied to embeddings from another modality.

Mars (Shih-Cheng) Huang, Anuj Pareek, Malte Jensen, Matthew P. Lungren, Serena Yeung, Akshay S. Chaudhari

January, 2023 Nature Digital Medicine

Self-supervised learning for medical image classification: a systematic review and implementation guidelines

In this review, we provide consistent descriptions of different self-supervised learning strategies and compose a systematic review of papers published between 2012 and 2022 on PubMed, Scopus, and ArXiv that applied self-supervised learning to medical imaging classification. With this comprehensive effort, we synthesize the collective knowledge of prior work and provide implementation guidelines for future researchers interested in applying self-supervised learning to their development of medical imaging classification models.

Mars (Shih-Cheng) Huang, Akshay S. Chaudhari, Curtis P. Langlotz, Nigam Shah, Serena Yeung, Matthew P. Lungren

November, 2022 Nature Communications

Developing medical imaging AI for emerging infectious diseases

In this review, we provide an evidence-based roadmap for how machine learning technologies in medical imaging can be used to battle ongoing and future pandemics. Specifically, we focus in each section on the four most pressing issues, namely - needfinding, dataset curation, model development and subsequent evaluation, and post-deployment considerations.

Mars (Shih-Cheng) Huang, Liyue Shen, Matthew P Lungren, Serena Yeung

October, 2021 ICCV

GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image Recognition

The purpose of this work is to develop label-efficient multimodal medical imaging representations by leveraging radiology reports. Specifically, we propose an attention-based framework (GLoRIA) for learning global and local representations by contrasting image sub-regions and words in the paired report. In addition, we propose methods to leverage the learned representations for various downstream medical image recognition tasks with limited labels.

Mars (Shih-Cheng) Huang, Anuj Pareek, Saeed Seyyedi, Imon Banerjee, Matthew P. Lungren

October, 2020 Nature Digital Medicine

Fusion of medical imaging and electronic health records using deep learning: a systematic review and implementation guidelines

In this reivew, we describe different data fusion techniques that can be applied to combine medical imaging with EHR, and systematically review medical data fusion literature published between 2012 and 2020. By means of this systematic review, we present current knowledge, summarize important results and provide implementation guidelines to serve as a reference for researchers interested in the application of multimodal fusion in medical imaging.

See all publications

All Publications

Quickly discover relevant content by filtering publications.

Yuhui Zhang, Jeff Z HaoChen, Mars (Shih-Cheng) Huang, Kuan-Chieh Wang, James Zou, Serena Yeung (2023). DrML: Diagnosing and Rectifying Vision Models using Language. ICLR.

PDF Cite Code

Rui Yan, Liangqiong Qu, Qingyue Wei, Mars (Shih-Cheng) Huang, Liyue Shen, Daniel Rubin, Lei Xing, Yuyin Zhou (2023). Label-efficient self-supervised federated learning for tackling data heterogeneity in medical imaging. IEEE Transactions on Medical Imaging.

PDF Cite Code DOI

Mars (Shih-Cheng) Huang, Anuj Pareek, Malte Jensen, Matthew P. Lungren, Serena Yeung, Akshay S. Chaudhari (2023). Self-supervised learning for medical image classification: a systematic review and implementation guidelines. Nature Digital Medicine (Under Review).

PDF Cite

Yuhui Zhang, Mars (Shih-Cheng) Huang, Zhengping Zhou,, Matthew P. Lungren, Serena Yeung (2022). Adapting pre-trained vision transformers from 2D to 3D through weight inflation improves medical image segmentation. ML4H.

PDF Cite Code

Mars (Shih-Cheng) Huang, Akshay S. Chaudhari, Curtis P. Langlotz, Nigam Shah, Serena Yeung, Matthew P. Lungren (2022). Developing medical imaging AI for emerging infectious diseases. Nature Communications.

PDF Cite DOI

Judy Wawira Gichoya, Imon Banerjee, Ananth Reddy Bhimireddy, John L Burns, Leo Anthony Celi, Li-Ching Chen, Ramon Correa, Natalie Dullerud, Marzyeh Ghassemi, Mars (Shih-Cheng) Huang, Po-Chih Kuo, Matthew P Lungren, Lyle J Palmer, Brandon J Price, Saptarshi Purkayastha, Ayis T Pyrros, Lauren Oakden-Rayner, Chima Okechukwu, Laleh Seyyed-Kalantari, Hari Trivedi, Ryan Wang, Zachary Zaiman, Haoran Zhang (2022). AI recognition of patient race in medical imaging: a modelling study. The Lancet Digital Health.

PDF Cite Code DOI

Andre Esteva, Jean Feng, Douwe van der Wal, Mars (Shih-Cheng) Huang, Jeffry P Simko, Sandy DeVries, Emmalyn Chen, Edward M Schaeffer, Todd M Morgan, Yilun Sun, Amirata Ghorbani, Nikhil Naik, Dhruv Nathawani, Richard Socher, Jeff M Michalski, Mack Roach III, Thomas M Pisansky, Jedidiah M Monson, Farah Naz, James Wallace, Michelle J Ferguson, Jean-Paul Bahary, James Zou, Matthew Lungren, Serena Yeung, Ashley E Ross, Howard M Sandler, Phuoc T Tran, Daniel E Spratt, Stephanie Pugh, Felix Y Feng, Osama Mohamad, NRG Prostate Cancer AI Consortium (2022). Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials. Nature Digital Medicine.

PDF Cite DOI

Andre Esteva, Jean Feng, Mars (Shih-Cheng) Huang, Douwe van der Wal, Jeffry Simko, Sandy DeVries, Emmalyn Chen, Edward M Schaeffer, Todd Matthew Morgan, Jedidiah Mercer Monson, Farah Naz, James Wallace, Michelle J Ferguson, Jean-Paul Bahary, Howard M Sandler, Phuoc T Tran, Daniel Eidelberg Spratt, Stephanie L Pugh, Felix Y Feng, Osama Mohamad (2022). Development and validation of a prognostic AI biomarker using multi-modal deep learning with digital histopathology in localized prostate cancer on NRG Oncology phase III clinical trials.. Journal of Clinical Oncology.

PDF Cite DOI

Jiangdian Song, Mars (Shih-Cheng) Huang, Brendan Kelly, Guanqun Liao, Jing-yun Shi, Ning Wu, Weimin Li, Zaiyi Liu, Lei Cui, Matthew P Lungren, Michael Moseley, Peng Gao, Jie Tian, Kristen Yeom (2021). Automatic lung nodule segmentation and intra-nodular heterogeneity image generation. IEEE Journal of Biomedical and Health Informatics.

PDF Cite DOI

Yuyin Zhou, Mars (Shih-Cheng) Huang, Jason Alan Fries, Alaa Youssef, Timothy J. Amrhein, Marcello Chang, Imon Banerjee, Daniel Rubin, Lei Xing, Nigam Shah, Matthew P. Lungren (2021). RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR. arXiv.

PDF Cite DOI

Mars (Shih-Cheng) Huang, Liyue Shen, Matthew P Lungren, Serena Yeung (2021). GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image Recognition. ICCV.

PDF Cite Code

Mars (Shih-Cheng) Huang, Anuj Pareek, Roham Zamanian, Imon Banerjee, Matthew P. Lungren (2020). Multimodal fusion with deep neural networks for leveraging CT imaging and electronic health record: a case-study in pulmonary embolism detection. Nature Scientific Reports.

PDF Cite DOI

Ashton Teng, Blanca Villanueva, Derek Jow, Mars (Shih-Cheng) Huang, Samantha N Piekos, Russ B Altman (2020). Biomedical Graph Visualizer for Identifying Drug Candidates. biorXiv.

PDF Cite DOI

Mars (Shih-Cheng) Huang, Anuj Pareek, Saeed Seyyedi, Imon Banerjee, Matthew P. Lungren (2020). Fusion of medical imaging and electronic health records using deep learning: a systematic review and implementation guidelines. Nature Digital Medicine.

PDF Cite DOI

Anirudh Joshi, Sabri Eyuboglu, Mars (Shih-Cheng) Huang, Jared Dunnmon, Arjun Soin, Guido Davidzon, Akshay Chaudhari, Matthew P Lungren (2020). OncoNet: Weakly Supervised Siamese Network to automate cancer treatment response assessment between longitudinal FDG PET/CT examinations. arXiv.

PDF Cite DOI

Mars (Shih-Cheng) Huang, Tanay Kothari, Imon Banerjee, Chris Chute, Robyn L Ball, Norah Borus, Andrew Huang, Bhavik N Patel, Pranav Rajpurkar, Jeremy Irvin, Jared Dunnmon, Joseph Bledsoe, Katie Shpanskaya, Abhay Dhaliwal, Roham Zamanian, Andrew Y Ng, Matthew P Lungren (2020). PENet—a scalable deep-learning model for automated diagnosis of pulmonary embolism using volumetric CT imaging. Nature Digital Medicine.

PDF Cite DOI

Adam Rule, Amanda Birmingham, Cristal Zuniga, Ilkay Altintas, Mars (Shih-Cheng) Huang, Rob Knight, Niema Moshiri, Mai H. Nguyen, Sara Brin Rosenthal, Fernando Pérez, Peter W. Rose (2019). Ten simple rules for writing and sharing computational analyses in Jupyter Notebooks. PLOS Computational Biology.

PDF Cite DOI

Experience

Research Scientist Intern

Google Research

September 2023 – Present Mountain View

Designed object embeddings to improve dense semantic understanding in Vision Language Models (VLMs) and curated a multi-image Visual Question Answering dataset to benchmark VLM’s dense semantic understanding