Articles tagged with AgenticReasoning
-
-
-
Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?
Self-generated agent context files don't help.
-
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks
Self-generated skills don't help.
-
-
-
Thinking LLMs: General Instruction Following with Thought Generation (Oct 2024)
a prompting and fine-tuning method that enables LLMs to engage in a "thinking" process before generating responses
-
Evaluation of OpenAI o1: Opportunities and Challenges of AGI
a comprehensive evaluation of o1-preview across many tasks and domains.
-