📊 Veilles thématiques
📰 Articles récents
Mobile-Agent-v3: Foundamental Agents for GUI Automation
GUI-Owl and Mobile-Agent-v3 are open-source GUI agent models and frameworks that achieve state-of-the-art performance across various benchmarks using innovations in environment infrastructure, agent c...
Voici la traduction : MobilityBench : Un benchmark pour évaluer les agents de planification de route dans des scénarios de mobilité réels
MobileBench is a scalable benchmark for evaluating LLM-based route-planning agents in real-world scenarios, featuring anonymized user queries and a deterministic sandbox for reproducible testing....
Arch-Router: Aligning LLM Routing with Human Preferences
A preference-aligned routing framework using a compact 1.5B model effectively matches queries to user-defined domains and action types, outperforming proprietary models in subjective evaluation criter...
PersonaLive! Expressive Portrait Image Animation for Live Streaming
PersonaLive is a diffusion-based portrait animation framework that improves real-time performance through hybrid implicit signals, appearance distillation, and autoregressive streaming generation....
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device
A compact vision-language-diffusion model called Mobile-O enables efficient unified multimodal understanding and generation on mobile devices through specialized architecture design and optimized trai...
A decoder-only foundation model for time-series forecasting
A large language model adapted for time-series forecasting achieves near-optimal zero-shot performance on diverse datasets across different time scales and granularities....
GLM-5: from Vibe Coding to Agentic Engineering
GLM-5 advances foundation models with DSA for cost reduction, asynchronous reinforcement learning for improved alignment, and enhanced coding capabilities for real-world software engineering....
AutoDev: Automated AI-Driven Development
AutoDev is an AI-driven software development framework that automates complex engineering tasks within a secure Docker environment, achieving high performance in code and test generation....
BitDance: Scaling Autoregressive Generative Models with Binary Tokens
BitDance is a scalable autoregressive image generator that uses binary visual tokens and diffusion-based methods to achieve efficient high-resolution image generation with improved speed and performan...
Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices
Monolingual ASR models trained on a balanced mix of high-quality, pseudo-labeled, and synthetic data outperform multilingual models for small model sizes, achieving superior error rates and enabling o...
Moonshine: Speech Recognition for Live Transcription and Voice Commands
Moonshine, an encoder-decoder transformer architecture for speech recognition, uses Rotary Position Embedding, reducing compute requirements without decreasing accuracy....
Towards Robust Mathematical Reasoning
Fatal error: Uncaught TypeError: Cannot access offset of type string on string in /var/www/dev.pittino.fr/public/veille.php:199 Stack trace: #0 {main} thrown in /var/www/dev.pittino.fr/public/veille.php on line 199