Latest publications

Hugging Face released a Skill for quickly porting Transformers models to MLX
Hugging Face introduced a Skill and a separate test harness to port new models from Transformers to mlx-lm on MLX almost immediately, without a stream of raw AI-generated PRs.

IBM Research analyzed where AI agents break down on APIs, documents, and rules in VAKRA
IBM Research's analysis of VAKRA shows that even strong models lose reliability when they have to combine APIs, documents, multi-step reasoning, and tool constraints.

Hugging Face published Ecom-RLVE, a training environment for e-commerce AI agents
Hugging Face published Ecom-RLVE, an open-source environment where AI agents learn to handle purchase conversations, use tools, and earn a verifiable reward for the cart they actually assemble.

TII introduced QIMMA — an Arabic LLM leaderboard with benchmark quality checks
TII launched QIMMA, an Arabic LLM leaderboard that first checks the benchmarks themselves and only then compares models on 52,000 examples from seven domains.

NVIDIA introduced Nemotron OCR v2: multilingual OCR trained on 12.2 million synthetic documents
NVIDIA showed how it built Nemotron OCR v2: the model was trained on 12.2 million synthetic documents to recognize multiple languages with a single engine and process up to 34.7 pages per second.

NVIDIA showed how Gemma 4 with voice and a webcam runs on Jetson Orin Nano Super
NVIDIA published a demo in which Gemma 4 decides on its own when to activate the webcam and responds by voice — all locally on Jetson Orin Nano Super with 8 GB of memory.

NVIDIA introduces NeMo Retriever — agentic search for complex enterprise data
NVIDIA showcased an agentic pipeline in NeMo Retriever: the system goes beyond semantic search, planning steps, refining queries, and has already taken first place in ViDoRe v3.

Nvidia unveiled the first open dataset and foundation AI models for medical robots
Nvidia and partners on Hugging Face released the first large open dataset for medical robots and two foundation models for surgery, simulation, and future autonomy.

NVIDIA released Nemotron 3 Nano 4B — a compact hybrid model for on-device deployment
NVIDIA made the 4B Nemotron 3 Nano model with a hybrid Mamba-Transformer architecture available: the lowest VRAM usage in its class, 18 tokens/s on Jetson Orin Nano, and open weights.

Hugging Face: Chinese open-source models overtake the US in AI ecosystem downloads
Hugging Face showed that open-source AI nearly doubled in scale over the past year, while Chinese models already account for 41% of downloads and set the pace in releases, adaptation, and local deployment.

AI Model Evaluation Now Costs More Than Training — A New Barrier for Researchers
EvalEval Coalition analyzed the cost of AI-benchmarks: a single agentic test costs $40,000 or more, and academic groups can no longer afford independent evaluation.

IBM reveals how it built Granite 4.1: 15 trillion tokens, 512K context window, and focus on quality
IBM detailed its approach to training Granite 4.1: five pretraining stages, 15 trillion tokens, context window up to 512K, and separate SFT and RL pipelines for quality improvement.

Hugging Face adds DeepInfra to Inference Providers for unified model API
Hugging Face connected DeepInfra to Inference Providers: DeepSeek, Kimi, and GLM models can now be run from Hub pages, via SDK, and through the unified router without separate integration.

NVIDIA Introduced Nemotron 3 Nano Omni for Long Documents, Audio, Video, and AI Agents
NVIDIA introduced Nemotron 3 Nano Omni — an open multimodal model for long documents, audio, video, and GUI scenarios with emphasis on speed and context.

Hugging Face Explains Fine-tuning of Multimodal Embeddings and Reranker Models
Hugging Face released a practical guide on training multimodal embedding and reranker models in Sentence Transformers and demonstrated how domain-specific fine-tuning improves document retrieval.

How Hugging Face Builds Scalable Web Apps with OpenAI Privacy Filter
Hugging Face demonstrated three scenarios for OpenAI Privacy Filter: document reading with PII highlighting, image anonymization, and secure pastebin with public and private versions.

Hugging Face: open-source AI gives defenders the same capabilities as attackers
Hugging Face explains why open models and tools are a structural advantage in cybersecurity, not a threat.

Hugging Face trained an image generation model in 24 hours
The third part of Hugging Face's PRX project shows that a full-fledged text-to-image model can be trained in just 24 hours. This changes perceptions of the accessibility of generative AI.

NVIDIA Nemotron 2 Nano 9B: a new standard for sovereign AI in Japan
NVIDIA unveiled the compact language model Nemotron 2 Nano 9B, optimized specifically for the Japanese language and the concept of sovereign AI.

SyGra Studio: Symbolic AI Attempts to Cure Neural Network Hallucinations
We're all a bit tired of how modern neural networks behave like talented but extremely irresponsible interns.

Holo2 from H Company: interfaces will finally stop scaring users
Interface localization has always been that "final boss" for developers that eats budgets and nerves.

Brazilian Nemotron: Why Silicon Valley No Longer Dictates the Rules
Brazilian Nemotron: Why Silicon Valley No Longer Dictates the Rules Imagine conversing with an incredibly intelligent interlocutor who knows everything in this world, yet views the world exclusively…

AprielGuard: A New Frontier in Protecting LLMs from Threats and Attacks
Modern large language models (LLM) demonstrate impressive capabilities, but they also open new horizons for attackers.