LLM Reasoning & Code Generation
Jan 1, 2025
·
1 min read

My research on LLMs spans reasoning evaluation, automated code/test generation, and reliability. Key contributions include:
- AgentTester — A multi-agent framework for automated unit test generation that significantly improves software reliability (AIWare 2025).
- HPCAgentTester — Extending multi-agent test generation to high-performance computing environments.
- LogBabylon — A unified framework for cross-log file integration and analysis using LLMs.
- Hallucination Mitigation — Techniques for reducing AI hallucinations in diagnostic and clinical contexts.
- Evaluating LLM Rationality & Randomness — Benchmarking the quality of reasoning and entropy in LLM-supported tasks.
- Synthesizing Public Opinions — Using LLMs to synthesize democratic discourse (IEEE ICEDEG 2025).
This line of research explores how multi-agent systems can produce more reliable, verifiable AI outputs, with applications from software engineering to healthcare.