Activity for mphysicus.bsky.social
Loading activity...
@sifa.id I have been facing an issue where, upon logging in, I am unable to access my profile.
Microsoft sucks... even in space. www.engadget.com/computing/ar... #Microslop
Been using @hf.co’s Accelerate library for a few months now, and I can’t emphasize enough how great it is. It makes launching distributed training effortless. If you’re into training deep learning models, definitely give it a try. GitHub repo: github.com/huggingface/... #ML #DistributedTraining
Congrats to the winners of this year's Turing Award - Charles Bennett and Gilles Brassard #ComputerScience #TuringAward
"Learning to Reason in 13 parameters" An interesting paper by Meta: arxiv.org/abs/2602.04118 Authors propose "TinyLoRA". They showed that Qwen 8B parameter model was able to achieve about 92% accuracy on certain maths datasets with just "13 trainable parameters (26 bytes)" for finetuning. #ML
I found this surprisingly counter-intuitive: so-called “attention sinks” in #LLMs are actually a feature, not a bug. Found this amazing paper on the topic: arxiv.org/pdf/2504.02732 #ML #deeplearning
Interesting piece by @quantamagazine.bsky.social on "Platonic Representation Hypothesis" (Neural networks, trained with different objectives on different data and modalities, are converging to as shared statistical model of reality in their representation spaces). #machinelearning
This year on Bluesky I wrote 22 posts and 11 replies. I received 49 likes, whereas 11 was from my most popular post, and apparently I love saying "galaxies" and 🌌! www.madebyolof.com/bluesky-wrap...