A controlled experiment comparing baseline LLM performance with agentic web search on 88 scientific claims. Results show a 19x improvement.
LLMAI AgentCitation VerificationWeb SearchHallucination
Read more →Exploring the intersection of AI, scientific research, and technology. Empirically grounded, openly shared.
A controlled experiment comparing baseline LLM performance with agentic web search on 88 scientific claims. Results show a 19x improvement.
I'm a Master's student in Business Informatics at Friedrich Schiller University Jena, focused on AI research and empirical methods. This blog documents my experiments, findings, and thoughts on technology and science.
Learn more