Sources

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL huggingface.co

Reachy Mini goes fully local huggingface.co

References

PULSE paper (arxiv 2602.03839) arxiv.org

PULSESync transmits lossless, sparse BF16 patches rather than full checkpoints, reducing payload sizes by over 100× (e.g., from 14 GB to ~108 MB for a 7B model) while ensuring trainer and worker weights remain bit-identical.

Medium — ‘How Cursor Trains Agentic Models with RL’ medium.com

Because only about 2% of weights typically change between consecutive RL checkpoints, this reduces a 1-terabyte snapshot to a manageable 20-gigabyte delta, enabling synchronization via commodity cloud storage rather than specialized RDMA fabrics… inference workers can apply these weight updates ‘in-flight’ mid-rollout.

hanifleo.com — ‘Anatomy of RL Frameworks’ hanifleo.com

Rank 0 typically has limited device memory, it must often load and broadcast parameters layer-by-layer or even dump weights to disk as checkpoints before relaying them to inference workers… generating long chain-of-thought sequences can account for up to 90% of total training time.

00f.net — ‘Xet intro’ 00f.net

Modifying 1% of a 500MB file resulted in only 5.5MB of data transferred—a 99% bandwidth saving compared to LFS… For a 5GB SQLite database, appending a mere 1MB of data reduced the update time from 13 minutes (via LFS) to just 0.1 seconds under Xet.

Hugging Face blog — ‘Storage Buckets’ huggingface.co

Buckets are non-versioned containers optimized for the mutable side of AI development, such as training checkpoints, optimizer states, and agent traces… Public Buckets start at $12/TB/month, Private Buckets at $18/TB/month, with egress and CDN usage included up to an 8:1 egress-to-storage ratio.

r/reinforcementlearning discussion of PULSE reddit.com

Sparsity is robust up to a staleness of 8-32 steps; exceeding these bounds leads to ‘graceful degradation’ in sparsity, which could eventually bottleneck communication in extremely asynchronous setups. Gradients themselves remain dense; the sparsity only emerges at the weight-update level due to low learning rates.

E2E Networks ASR benchmark (NVIDIA L4) e2enetworks.com

Parakeet-TDT 0.6B WER regresses from 6.32% in batch mode to 9.22% in streaming mode — a 46% relative increase

r/robotics — Reachy Mini behavior iteration thread reddit.com

perceived success rate of only ~30% for fast, relevant answers, with response lag as high as 15 seconds during live interactions

XMOS — CES 2026 Reachy Mini XVF3800 announcement xmos.com

XMOS XVF3800 voice processor provides 360° far-field capture and acoustic echo cancellation so the robot can hear commands while its motors are active

huggingface/speech-to-speech GitHub issues github.com

strong community demand for integrated Acoustic Echo Cancellation (AEC) to prevent the model from ‘hearing’ its own output — currently marked Open; users also report ‘missing first words’ and audio breaks when play_steps_s is set too low

Medium — High-Quality Long-Form TTS with Qwen3 medium.com

the Qwen3-TTS 0.6B model is unstable in long-form text, producing unnatural pauses or artifacts compared to the robust 1.7B version

r/ReachyMiniDev — shipping ETA thread reddit.com

localized RAM shortage has specifically impacted the Wireless variant; the onboard Pi is often insufficient for heavy LLM tasks, so developers offload the ‘brain’ to a workstation while keeping the daemon on the Pi

Sources

References

Jack Sun, writing.