Hacker Newsnew | past | comments | ask | show | jobs | submit | genxy's commentslogin

Where is the belly button?!

I really enjoyed this 'Newel of Knowledge' video on how to deliver praise w/o triggering someones BS detector https://www.youtube.com/watch?v=C-ZE601hf28

tl;dw celebrate don't manipulate.

I have noticed that my daughter responds so much better to compliments like, "I am really impressed how you handled situation X, you stuck up for yourself in a way that doesn't alienate yourself with your friends. I know how hard that is to do well ..." When I give the reason for the praise it doesn't come off, intended or not as "you did good for a kid." I don't do it everytime, but sometimes the praise works to give another reflective or metacognitive pass over the event.



Burn's great, and we cite it in the paper's related work. Different layer, though: Burn is the high-level DL framework (tensor ops, autodiff, backends), whereas cuTile Rust is the kernel-authoring layer underneath. The closer neighbor is actually CubeCL, Tracel's compute layer that Burn sits on, and there's an open issue to integrate cuTile Rust into CubeCL, so we're already looking at how these fit together. See https://github.com/tracel-ai/cubecl/issues/1282

Yeah, I linked to Burn since it was the parent framework, but yes CubeCL. Still blows my mind that they basically implemented a subset of Rust inside of a Rust macro so you can write your compute kernels in Rust.

Nathaniel Simard: "Rust for AI & Accelerated Computing" | RustConf 2025 https://www.youtube.com/watch?v=RaSxyRQ7egU


Yeah, first think I looked for on the post was MLX and it wasn't there.

https://github.com/ml-explore/mlx-lm

Having used half the systems that Vicki mentioned, mlx was the best balance between power and ease of use. Just a pip install away.


Let them eat steak!

The context window is 16 characters. Talking about tokens per second is meaningless.

its not meaningless. there could be usecases like spell correction.

It is only interesting as an academic exercise in EDA design. Just like microGPT. For something with an n^2 complexity and advertising perf is clickbait.

This is the same way you get people to do bad stuff as well. Make the task small enough so that the moral curvature of the topology is flat and even though they know it is a not-good part of a larger bad part they just shrug. Look at all the wonderful people we know who are working at Amazon and Meta? Corporatism has already jailbroken society.

IIRC that is how Uber implemented their "Greyball" system, which was designed to prevent government employees from actually hailing rides, without completely locking them out of the system (same idea as "shadowbanning"). One team works on "figure out where people work" with the pitch that you can improve routing and ride-share capacity for predictable demand. Another team works on "Display fake data to users" with the pitch being "This is for testing the mobile app in new markets with no drivers yet". Another team works on "mark a user as unable to successfully hail rides" so you can test the failure paths in the app. Then, only the people at the top have the full picture and can put the pieces together to shadowban the regulators.

I don't think people working at Amazon "know that it is a part of a larger bad", it's one of the most trusted American institutions.

https://www.theargumentmag.com/p/why-everyone-loves-amazon


You really don't think know they are selling endless counterfeit products? Don't know they are taking part in massive return fraud against small sellers? You don't think they know they totally ignore sellers with problems even if their livelihood depends on it?

Pretty awesome that Fox now has access to all Roku user's watch history.

Bears, Sharks, they didn't link to reddit. The result is load-bearing.

It should be called something else, maybe Ensemble? It doesn't fuse anything.

Yeah the Rio thing is a better candidate for that word, where they averaged the weights for two models:

https://news.ycombinator.com/item?id=48528371


Ensemble or 'mixture of experts' (before MoE model architecture was a thing).

The good names all get applied to the wrong things. It would be awesome if we could stop every couple years and define in-domain terms we can all agree on. Like how hallucination has won out over confabulation. The precise and accurate definition almost never wins out. :(

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: