[D] Moral Uncertainty Around Emerging AI Introspection
Relevant paper to read first: https://transformer-circuits.pub/2025/introspection/index.html On the Moral Uncertainty Emerging Around AI Introspection In...
Relevant paper to read first: https://transformer-circuits.pub/2025/introspection/index.html On the Moral Uncertainty Emerging Around AI Introspection In...
Hey everyone in the ML community, I wanted to start by saying a huge thank...
Imagine your ML development environment running inside a web platform where each tool such as...
Hey all, I am excited to share our new pre-print with you. GRAM: a General-purpose...
https://preview.redd.it/7u5do1x19uzf1.png?width=1103&format=png&auto=webp&s=bfc314716f4e33593b16e6e131870dae62d7577a Hey All, We have just released our new pre-print on WavJEPA. WavJEPA is an...
https://preview.redd.it/a9a5cmud890g1.png?width=320&format=png&auto=webp&s=4d3b35fe360f74ce16de394f4cce37ac00ca6acf Hello everyone, I am training a captcha recognition model using CRNN. The problem now...
The last few months I've been doing a deep-dive into information geometry and I've really,...
ElikaAi AI Trainer v2.0 — Open-Source Sandbox for Teaching Transferable Skills (Apache 2.0) I’ve...
Hey everyone, I've been reading about "World Models" for a while now and wanted to...
Abstract: Flow-based generative modeling provides a powerful framework for reasoning about uncertainty in weight space....
Hey everyone, I'm working on a personal project (AI for agriculture) and I just spent...
I have a small tabular dataset with ~ 300 elements. I have to build a...
Hi all! I’ve been experimenting with long-term memory for LLM agents under small context budgets,...
Hi everyone, I’m exploring the idea of creating a small, high-signal peer collaboration model for...
Human Action Classification: Reproducible Research Baselines Hey r/MachineLearning! I built reproducible baselines for human action...
Hey guys! I’ve open-sourced mamba2-jax, an experimental but stable JAX/Flax implementation of Mamba2 (“Transformers are...
We tested a small “attractor” layer that updates during inference (no training/backprop). It preserved perplexity...
This is a project I’ve been working on quietly for a while, and I finally...
TL;R: Fine-tuned GPT-4.1-nano achieved 98% of Claude Sonnet 4's quality (0.784 vs 0.795) on structured...