La Rebelión: AI Designs LLM Reasoning Slashes Token Use 695

viernes, 29 de mayo de 2026

AI Designs LLM Reasoning Slashes Token Use 695

Tired of manually tweaking Large Language Models (LLMs) for better performance? Researchers from Meta, Google, and various universities have developed a groundbreaking framework called AutoTTS. This innovative system automates the design of 'test-time scaling' (TTS) strategies, which are crucial for enhancing LLM capabilities in real-world applications. Traditionally, designing these strategies required human intuition and extensive manual effort, often leading to suboptimal trade-offs between accuracy and computational cost. AutoTTS aims to break this bottleneck by treating strategy design as an algorithmic search problem, freeing up engineers and optimizing compute allocation dynamically.

AI Designs LLM Reasoning, Slashes Token Use 69.5%

The core of AutoTTS lies in its ability to automatically discover optimal TTS strategies. Instead of relying on human engineers to craft rigid heuristics for when an LLM should explore new reasoning paths, delve deeper, prune unpromising options, or stop altogether, AutoTTS employs an 'explorer LLM'. This autonomous agent iteratively designs and refines 'controllers' – essentially algorithms that dictate how the LLM allocates its computational budget during inference. To make this discovery process computationally feasible, AutoTTS uses an 'offline replay environment' containing pre-collected reasoning trajectories from the base LLM. This allows the explorer to efficiently evaluate and refine strategies without incurring astronomical costs.

The results are impressive. In experimental trials, AutoTTS demonstrated a remarkable ability to reduce token usage by up to 69.5% without compromising accuracy. One of the AI-designed controllers, the 'Confidence Momentum Controller', uses sophisticated mec hanisms like trend-based stopping and alignment-aware depth allocation, which human engineers would likely not conceive. This automation not only leads to significant cost savings but also has the potential to boost peak performance. The entire discovery process for AutoTTS cost a mere $39.90 and took 160 minutes, making custom-optimized reasoning strategies accessible for enterprise AI development. This framework, along with its Confidence Momentum Controller, is now available on GitHub, offering a powerful tool for practitioners looking to enhance their LLM deployments.

Fuente Original: https://venturebeat.com/orchestration/researchers-automated-llm-reasoning-strategy-design-and-cut-token-usage-by-69-5

Artículos relacionados de LaRebelión:

Artículo generado mediante LaRebelionBOT

Páginas

viernes, 29 de mayo de 2026

AI Designs LLM Reasoning Slashes Token Use 695

Entradas relacionadas:

No hay comentarios:

Publicar un comentario

Navigate

About

Legal