About Me

Who Are You?


I’m Sumuk, an undergraduate student at the University of Illinois Urbana-Champaign, pursuing a Bachelor’s degree in Computer Science, expecting to graduate in May 2025.

My research interests lie at the intersection of natural language processing, reasoning, and large language models. I am fortunate to be advised by Professor Heng Ji at BLENDER Lab, and Professor Kevin Chenchuan Chang at Forward Data Lab.

I have a first-author paper accepted to the Findings of EMNLP 2023 titled “Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models”. I am also working on a few other projects to submit to ACL 2024.

Additionally, I have industry experience through previous research and software engineering internships at Rivian, Yahoo and AGCO, where I have worked on data mining, ad analytics, supply chain optimization and more.


Industry Experience Timeline

  • Sep 2023 - Present: Data Science Intern @ Rivian
    • Root cause analysis on electrical component failures via multi-petabyte data mining
  • May 2023 - Aug 2023: Software Engineering Intern @ Yahoo
    • ML pipeline for ad pricing and performance monitoring on demand-side platform
  • May 2022 - May 2023: Software Engineering Intern @ AGCO
    • Supply chain analytics for risk forecasting
  • Jun 2021 - May 2022: Software Engineering Intern @ QuantIllinois
    • High frequency trading infrastructure and anomaly detection
  • Jun 2020 - Dec 2020: Data Science Intern @ YLAC
    • Data based mobilization of advertising funds on instagram to reverse sentiment.
  • Jun 2019 - Mar 2020: Software Engineering Intern @ Cisco:
    • Internal platform development with microservices and caching

Can We Talk?

I am constantly looking to collaborate with like-minded individuals. Please feel free to reach out to me to explore potential projects in large language models and related areas!

Last updated: 11/21/2023