Haocheng Dai


vitae / linkedin / github / youtube / flickr / instagram

i am an applied scientist at amazon science working on rufus, the next generation llm-based conversational shopping assistant.

i earned my doctoral degree in computer science from the university of utah — the birthplace of computer graphics and the embryo of the current worldwide internet (arpanet). i was fortunate to be mentored by dr. sarang joshi with affiliations to the scientific computing and imaging institute and kahlert school of computing. i also work closely with researchers from fsu, ucla, uva, and yale.

before i joined the u, i received my bachelor degree in computer science from tongji university and have studied at israel institute of technology and institut de mathématiques de toulouse as an exchange student, focusing on image analysis and riemannian geometry, respectively.

currently, i am passionate about developing specialized and trustworthy machine learning tools. my research extends to, but is not limited to: 📄large language models and retrieval-augmented generation; 👨‍⚖️trustworthy machine learning (fairness and robustness); 👁️vision language and diffusion models; 📐geometric deep learning and shape modeling; and 🔭physics-informed machine learning.



publications & preprints

[*=equal contribution]


Refining Skewed Perceptions in Vision-Language Models through Visual Representations.

pmb



The Silent Majority: Demystifying Memorization Effect in the Presence of Spurious Correlations.
  • Chenyu You*, Haocheng Dai*, Yifei Min*, Jasjeet Sekhon, Sarang Joshi, James Duncan.
  • Preprint, 2024
  • 👨‍⚖️👁️ / Paper / Code / Slides / Citation

pmb



High-Fidelity CT on Rails-Based Characterization of Delivered Dose Variation in Conformal Head and Neck Treatments.
  • Haocheng Dai, Vikren Sarkar, Christian Dial, Markus Foote, Ying Hitchcock, Sarang Joshi, Bill Salter.
  • Applied Radiation Oncology (ARO), 2023
  • 👨‍⚖️🧑‍⚕️ / Paper / Code / Slides / Citation

pmb



Detect AI-generated Images Uploaded for Risk Evidence Collection in CSSW.
  • Haocheng Dai, Siwei Chen, Bei Xiao, Yangho Chen.
  • Amazon Machine Learning Conference (AMLC), 2023
  • 👨‍⚖️👁️ / Paper / Citation

ipmi



Neural Operator Learning for Ultrasound Tomography Inversion.
  • Haocheng Dai*, Michael Penwarden*, Mike Kirby, Sarang Joshi.
  • International Conference on Medical Imaging with Deep Learning (MIDL), 2023
  • 🧑‍⚕️🔭 / Paper / Code / Slides / Poster / Citation

midl



Modeling the Shape of the Brain Connectome via Deep Neural Networks.
  • Haocheng Dai, Martin Bauer, Tom Fletcher, Sarang Joshi.
  • International Conference on Information Processing in Medical Imaging (IPMI), 2023
  • Oral Presentation
  • 🧑‍⚕️📐🔭 / Paper / Code / Slides / YouTube / Citation / Media Coverage

ipmi



Understanding Visual Documents from Customer Self-Service Workflow using Multimodal Transformer.
  • Haocheng Dai, Jia-Kai Chou, Siwei Chen, Bei Xiao, Yangho Chen.
  • Amazon Machine Learning Conference (AMLC), 2022
  • 👁️ / Paper / Citation

ipmi



Integrated Construction of Multimodal Atlases with Structural Connectomes in the Space of Riemannian Metrics.
  • Kris Campbell, Haocheng Dai, Zhe Su, Martin Bauer, Tom Fletcher, Sarang Joshi.
  • Machine Learning for Biomedical Imaging (MELBA), 2022
  • 🧑‍⚕️📐 / Paper / Code / Citation

melba



Structural Connectome Atlas Construction in the Space of Riemannian Metrics.

ipmi



services


i have served as a reviewer for several journals and conferences, including acm mm, aistats, acm tist, cvpr, aro, iclr, icml, ieee tnnls, media, melba, miccai, midl, neurips, scientific reports, ai for differential equations in science@iclr, and wicv@eccv.



miscellaneous


i made a handful of notes for better understanding in language models, machine learning, mathematics of imaging, metric estimation, image registration, and solving large systems of linear equations.

my erdős number = 4:
haocheng dai -> sarang joshi -> ulf grenander -> oved shisha -> paul erdős;
haocheng dai -> mike kirby -> frank stenger -> ambikeshwar sharma -> paul erdős.

i am an amateur photographer, vlogger and also a loyal reader of 📰 newspapers, you can find the highlight front pages of the new york times i collect by the years of 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022, 2023, and 2024; the highlight front pages of the washington post I collect before 2015, 2016 - 2020, and 2021 - 2024.



footprints


ipmi