londons_explorer

londons_explorer t1_j0642cz wrote

I suspect there will be far more accidentally malicious clients (because someone is experimenting with the code) than deliberately evil clients.

In fact, as long as the percentage of deliberately evil clients is small, and the weight updates they send have capped magnitudes, I suspect you can just ignore the problem - they won't have a substantial effect on the results.

1

londons_explorer t1_j03uyis wrote

People are talking about malicious clients...

But I wonder more about accidentally evil clients. For example, someone was modding the code and accidentally connects to the public swarm and returning NaN for every request.

Can we have an API which returns the git hash that it's running from and detects if the working directory is clean?

1

londons_explorer t1_ixgndbj wrote

When you said 'avoid high level frameworks', I thought you were going to ask about building matrix operations with for loops in C++...

I'm here to say that compilers aren't as good as you imagine, and the built in intrinsics in pytorch will perform far better than anything you will write in C/C++/assembly (without months of effort), even before you get to GPU stuff...

12

londons_explorer t1_iwp5r0a wrote

There is a lot more data that could be used in the form of private communications (for example all iMessage chats), if only the ethical and legal side could be sorted out.

2