londons_explorer
londons_explorer t1_j03uyis wrote
People are talking about malicious clients...
But I wonder more about accidentally evil clients. For example, someone was modding the code and accidentally connects to the public swarm and returning NaN for every request.
Can we have an API which returns the git hash that it's running from and detects if the working directory is clean?
londons_explorer t1_iyc4m3k wrote
Reply to comment by AlSweigart in Google has to pay $9.4 million because it paid people to say they liked the Pixel 4 by RunOrDieTrying
More like "If you rate this 5 stars, we'll give you a full refund, an extra $10, and you get to keep the product for free"
londons_explorer t1_ixyoyde wrote
Reply to comment by Steinrikur in Tesla recalls more than 15,000 Australian electric vehicles over faulty tail lights by ninjascotsman
Lots of cars never get internet access at all.
Lots of owners can't be bothered to connect their car to WiFi. They just want to drive it places.
Those cars will never get this update, and will continue to be a risk to other drivers and the public.
londons_explorer t1_ixpnmnb wrote
Is this fixed with a software update like Tesla?
Or will 634,000 people have to take the day off work to take their car to a garage?
londons_explorer t1_ixgndbj wrote
When you said 'avoid high level frameworks', I thought you were going to ask about building matrix operations with for loops in C++...
I'm here to say that compilers aren't as good as you imagine, and the built in intrinsics in pytorch will perform far better than anything you will write in C/C++/assembly (without months of effort), even before you get to GPU stuff...
londons_explorer t1_ixakobh wrote
Reply to Suggestions for a socially valuable project that would welcome an unpaid contributor [D] by AnthonysEye
Rebuild models from published papers opensource.
You learn a lot and it's very valuable for follow up research by others
londons_explorer t1_iwp5r0a wrote
Reply to [R] Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning - Epochai Pablo Villalobos et al - Trend of ever-growing ML models might slow down if data efficiency is not drastically improved! by Singularian2501
There is a lot more data that could be used in the form of private communications (for example all iMessage chats), if only the ethical and legal side could be sorted out.
londons_explorer t1_ivkbf68 wrote
Reply to comment by JohnTM3 in TIL: That in 2019, some criminals were caught and 36 BTC was confiscated. The authorities valued the worth at €127,000 at the time and was sold 2 years later for €1.5 million. The criminals were returned €1.3 million due to the fact that the amount was expressed in Swedish Krona, not BTC by Wendals87
The police will choose not to investigate, and the da will choose not to prosecute.
londons_explorer t1_itqixfz wrote
Before concatenating them, I would want to be sure that the mean and variance of both embedding vectors is normalized....
londons_explorer t1_itkmja3 wrote
Reply to comment by tickettoride98 in NASA poised to break sound barrier without the sonic boom by Ssider69
It's fairly clear that humans and nature can never life in perfect harmony. The only way is to separate the humans and the nature - for example we could all move to Mars and leave earth for the nature.
londons_explorer t1_is9c1xs wrote
Reply to [R] Mind's Eye: Grounded Language Model Reasoning through Simulation - Google Research 2022 by Singularian2501
This seems to basically be injecting a tiny amount of rule-based decision making into a language model...
The physics model is so limited that it can only work in a very tiny number of cases, and the results might as well be hardcoded prompts to inject.
londons_explorer t1_is11x8p wrote
Reply to comment by shahaff32 in [R] Wavelet Feature Maps Compression for Image-to-Image CNNs by shahaff32
Sure this work was aimed at that, but these same techniques can be used to make a datacenter-scale inference machine into an even more powerful one.
And presumably if a way can be found to do backpropagation in 'wavelet domain', then training could be done like this too.
londons_explorer t1_is0rn7l wrote
This is the kind of research that makes companies with hardware accelerators (google, nvidia, tesla, etc.) suddenly have to redesign and re-buy their very expensive hardware accelerators...
londons_explorer t1_j0642cz wrote
Reply to comment by ReginaldIII in [Project] Run and fine-tune BLOOM-176B at home using a peer-to-peer network by hx-zero
I suspect there will be far more accidentally malicious clients (because someone is experimenting with the code) than deliberately evil clients.
In fact, as long as the percentage of deliberately evil clients is small, and the weight updates they send have capped magnitudes, I suspect you can just ignore the problem - they won't have a substantial effect on the results.