About Me
I’m a PhD student in Computational Science and Engineering at McMaster University, advised by Prof. Elkafi Hassini and Prof. Saiedeh Razavi.
My research interest is broadly in the convergence of geospatial data science, computational analytics, operations research, deep learning, and GenAI.
Currently, I’m affiliated with the Smart Freight Centre and working as a Research Scientist at Geotab under the Altitude team. In addition to my research, I am a sessional instructor for a statistics course at McMaster University.
Before that, I earned my B.S. in Mathematics (Applied) from the University of Illinois Urbana-Champaign.
Working Papers
- Ma, Y., Foda, A., Hassini, E., & Mohamed, M. Classifying E-Commerce Vehicle Trajectories Using Vision-Based Representation Learning and LLM-Augmented Metadata Labeling.


- Ma, Y., Hassini, E., & Razavi, S. Impact of Bus Rapid Transit on Freight Movement: A Telematics Approach.



- Ma, Y., Hassini, E., & Razavi, S. Evaluation of Emission Trade-offs in Optimized Urban Truck Routes under Dynamic Traffic.

Publications
- Ma, Y., Liu, C. A., Hassini, E., & Razavi, S. (2024). A Network-Based, Data-Driven Methodology for Identifying and Ranking Freight Bottlenecks. Data Science for Transportation, 6(3), 20. https://doi.org/10.1007/s42421-024-00107-z

- Ma, Y., Amiri, A., Hassini, E., & Razavi, S. (2022). Transportation data visualization with a focus on freight: a literature review. In Transportation Planning and Technology (Vol. 45, Issue 4, pp. 358–401). Informa UK Limited. https://doi.org/10.1080/03081060.2022.2111430


Patent
- Ma, Y., & Liu, C. A. (2024). Systems and methods for identifying and ranking traffic bottlenecks. U.S. Patent Application No. US20240161606A1.




Projects
Project | Description |
---|---|
MBC AI Dashboard | LLM-driven pipeline for sermon analytics, featuring a data dashboard with NLP-based feature extraction and visualization toolkit. |
WerewolfGPT | Multi-agent LLM system simulating conversational deduction in Werewolf game, leveraging natural language reasoning. |
TumorRxGPT | LLM fine-tuned on EMRs for tumor phenotyping and treatment recommendation, integrating structured/unstructured data processing. |
MosaicNFT Generator | C++ optimized mosaic generator using kd-trees for spatial indexing and LUV color space for perceptual similarity matching. |
TriCount Proof Protocol | Interactive sum-check protocol implementation with prover/verifier for triangle counting in graphs, leveraging polynomial verification. |