VCBench: Benchmarking LLMs in Venture Capital
Benchmarks such as SWE-bench and ARC-AGI demonstrate how shared datasets accelerate progress toward artificial general intelligence (AGI). We introduc...
ARAG is a personalized recommendation framework that extends Retrieval-Augmented Generation by using four specialized LLM-based agents to analyze long-term and session user behavior, assess item relevance, summarize context, and rank items.
Unlike standard RAG, ARAG employs a multi-agent collaboration mechanism for more dynamic, context-aware recommendations and fine-grained alignment with user intent. Experiments across three datasets show that ARAG delivers significant improvements over existing baselines in key recommendation metrics.