Browsing Tag
deeplearning
31 posts
【红杉播客】AI Neolab–Engram【主攻记忆与持续学习】–分享未来 AI 发展趋势的独特见解
https://www.youtube.com/watch?v=aiR7F4jqjXY 在这期由红杉资本(Sequoia Capital)主持的《Training Data》播客节目中,初创公司 Engram 的联合创始人 Dan Biderman 和 Jessy Lin 深入探讨了 “记忆(Memory)与持续学习(Continual Learning)” 在 AI 领域的核心作用,并分享了他们对未来 AI…
Anthropic’s Fable/Mythos shutdown is the first real model export-control shock
Anthropic’s Fable/Mythos shutdown is the first real model export-control shock The important AI story this week is not…
Como Precificar Opções em Nível Institucional Usando IA (PINNs) e Python
Se você trabalha ou estuda o mercado de derivativos, sabe que a velocidade e a precisão no cálculo…
Did My LoRA Learn Tenacious Style—or Just Memorize Augmented Patterns?
In Week 11 Tenacious-Bench, we trained a LoRA adapter on Tenacious-style B2B sales emails using Supervised Fine-Tuning (SFT).…
Equilibrated adaptive learning rates for non-convex optimization
Train deep learning models faster with a simple tweak: ESGD Struggling to make deep learning train faster? Many…
On the Effects of Idiotypic Interactions for Recommendation Communities inArtificial Immune Systems
How a Body’s Tricks Could Make Your Recommendations Smarter Imagine a suggestion system that borrows ideas from the…
Deep Learning Without Backpropagation
Most modern neural networks learn using backpropagation. It works well, but it has a strange property: learning depends…
Backprop Finally Made Sense When I Reimplemented It in Rust
I never used PyTorch or TensorFlow. My ML background was NumPy and scikit-learn. I could train models, tune…
Fast Transformer Decoding: One Write-Head is All You Need
Faster Transformer Decoding — One Write-Head Changes How AI Replies Imagine your phone trying to build a sentence…
Deep Graph Library: A Graph-Centric, Highly-Performant Package for Graph NeuralNetworks
Deep Graph Library — Faster, lighter tools for learning from networks Imagine a toolbox that helps computers learn…