◆ doyled-it.com · résumé

◆ Michael Doyle ◆

AI Researcher & Engineer

download pdf

Summary

An AI researcher with a passion for language, open source, and applying research to important problems.

Work

Lead AI Research Engineer · The MITRE Corporation Jul 2018 — present

Led multiple ML projects. Researched, developed, tested, trained, and deployed machine learning models and applications.

  • Led research project on LLMs, IT Modernization, and code understanding
  • Led research project on training object detectors for neuromorphic cameras
  • Maintained internal GPU servers and services on OpenShift and Linux servers
  • Developed and open sourced fire simulator to be used in RL training for wildfire mitigation
  • Developed and open sourced LLM IT modernization library
  • Developed agentic LLM backend prototype for government sponsor application
  • Directly deployed trained models on government sponsor systems
Firmware Engineering Intern · Space Micro Inc. May 2016 — May 2018

Wrote, simulated, synthesized, and implemented FPGA code in VHDL using ModelSim, Synplify Pro, Vivado, and Libero Designer. Designed PCBs in KiCAD and tested hardware.

Education

BS/BA Dual Degrees in Electrical Engineering · University of San Diego Sep 2013 — May 2018

Skills

Machine Learning
NLP · LLMs · Computer Vision · Acoustics · RL · Simulation · Adversarial
Languages
Python · Bash · SQL · LaTeX · Vue.js · JavaScript · CSS · MATLAB · VHDL
Libraries
NumPy · SciPy · PyTorch · TensorFlow · FastAPI · Typer · LangChain · Chroma · HuggingFace
Technologies
Git · Docker · OpenShift · GitLab CI · GitHub Actions

Projects

PFC-LLM Aug 2024 — Aug 2024
    Janus LLM Jul 2023 — present
      SimFire Sep 2019 — present
        SimHarness Sep 2019 — present

          Publications

          Can LLMs Replace Humans During Code Chunking? 2025
          ArXiv

          Investigates using LLMs to modernize legacy government code by focusing on code-chunking methods to overcome input limitations, showing that LLMs can effectively partition code and generate high-quality documentation.

          Impact of Comments on LLM Comprehension of Legacy Code 2025
          ArXiv

          Presents preliminary findings on the impact of documentation on LLM comprehension of legacy code, leveraging multiple-choice question answering (MCQA) for evaluation.

          Leveraging LLMs for Legacy Code Modernization: Challenges and Opportunities for LLM-Generated Documentation 2024
          ArXiv

          Investigates using LLMs to generate documentation for legacy code in MUMPS and ALC, proposing a prompting strategy and evaluation rubric while highlighting the limitations of current automated metrics.

          Testing the Effect of Code Documentation on Large Language Model Code Understanding 2024
          North American Association for Computational Linguistics (NAACL)

          Empirically analyzes how code documentation quality impacts the code generation and understanding capabilities of Large Language Models (LLMs). It reveals that incorrect documentation significantly hinders LLMs' code comprehension, while incomplete or missing documentation has no significant impact.

          Reinforcement Learning for Wildfire Mitigation in Simulated Disaster Environments 2023
          Neural Information Processing Systems (NeurIPS)

          Presents a reinforcement learning approach to wildfire mitigation in simulated disaster environments. We release two software libraries, SimFire and SimHarness, to facilitate future research in this area.

          Practical Attacks on Machine Translation using Paraphrase 2022
          Association for Machine Translation in the Americas (AMTA)

          Investigated the vulnerability of machine translation systems to adversarial attacks constructed with limited information. A novel attack method was proposed that generates perturbations using paraphrases and evaluates their impact on meaning preservation and translation degradation across various language pairs and systems.

          The vulnerability of UAVs: an adversarial machine learning perspective 2021
          SPIE

          Proposes a methodology to evaluate the vulnerability of unmanned aerial vehicles (UAVs) to adversarial machine learning attacks by analyzing potential attack vectors at each stage of UAV operation.

          [H] home[R] résumé[P] projects[W] words[M] music[C] contact[B] baseball[G] golf[V] movies