Search

50 results for openai.com:

qalis t1_j6mbu5s wrote

www.youtube.com/watch?v=5D315JD8kYg) and [GPT-3 paper](https://arxiv.org/pdf/2005.14165.pdf) to learn about GPT-3 \- [InstructGPT page](https://openai.com/blog/instruction-following/) and [InstructGPT paper](https://arxiv.org/pdf/2203.02155.pdf) to learn about InstructGPT, the sibling model of ChatGPT ... understand, this is the same as "GPT-3.5" \- [ChatGPT page](https://openai.com/blog/chatgpt/) to learn about differences between InstructGPT and ChatGPT, which are relatively small as far as I understand; it is also sometimes called ... reinforcement learning with human feedback (RLHF) \- RLHF is based on Proximal Policy Optimization algorithm \- [PPO page](https://openai.com/blog/openai-baselines-ppo/) and [PPO paper](https://arxiv.org/pdf/1707.06347.pdf)

3

Submitted by shitty-greentext t3_11rc02e in MachineLearning

Research blog: [https://openai.com/research/gpt-4](https://openai.com/research/gpt-4) Product demo: [https://openai.com/product/gpt-4](https://openai.com/product/gpt-4) Research report: [https://cdn.openai.com/papers/gpt-4.pdf](https://cdn.openai.com/papers/gpt-4.pdf) API waitlist: [https://openai.com/waitlist/gpt-4-api](https://openai.com/waitlist/gpt-4-api) Twitter announcement: [https://twitter.com/OpenAI/status/1635687373060317185](https://twitter.com/OpenAI/status/1635687373060317185) OpenAI developer livestream: [https://www.youtube.com/watch?v=outcGtbnMuQ](https://www.youtube.com/watch?v=outcGtbnMuQ&ab_channel=OpenAI)

1

PlaysForDays t1_iw04wp9 wrote

public benefit company while putting enough content behind walls that [Microsoft is willing to pay](https://openai.com/blog/openai-licenses-gpt-3-technology-to-microsoft/) to knock them down. Even stuff that's free-as-in-beer is not free

11

SoylentRox t1_iyyvlhg wrote

Reply to comment by Head_Ebb_5993 in bit of a call back ;) by GeneralZain

environments, some accurate enough to *immediately* use in the real world - see here for an example [https://openai.com/blog/solving-rubiks-cube/](https://openai.com/blog/solving-rubiks-cube/) \- to force an agent to develop intelligence. (2) neuroscientists have known for years that the brain

7

the-sun-is-gone t1_izeztv6 wrote

Open AI explaining how they invented their own legal structure because nothing else worked for them: [https://openai.com/blog/openai-lp/](https://openai.com/blog/openai-lp/) Stable Diffusion release info with no mention of “artists”: [https://stability.ai/blog/stable-diffusion-announcement](https://stability.ai/blog/stable-diff) A great article summarizing

1

pythoslabs t1_j00ltu7 wrote

hereby assigns to you all its right, title and interest in and to Output."* Reference link : [https://openai.com/api/policies/terms/](https://openai.com/api/policies/terms/) In other words .. the OP has the right to the content he has generated using

2

Ortus14 t1_j2luhse wrote

build a sufficiently aligned AI system that can help us solve all other alignment problems." [https://openai.com/blog/our-approach-to-alignment-research/](https://openai.com/blog/our-approach-to-alignment-research/) ChatGTP has some alignment in avoiding racist and sexist behavior, as well as many other human morals

2

SoylentRox t1_j3jtjb1 wrote

accuracy and efficiency. *Some of the solutions to RL environments are pretty creative, like box surfing.* [*https://openai.com/blog/emergent-tool-use/*](https://openai.com/blog/emergent-tool-use/) Answer The Ultimate Question of life the Universe and everything *humans can't* Solve Annoying Interview

15

gwern t1_j9r43jv wrote

Reply to comment by Hodoss in And Yet It Understands by calbhollo

yields 'hacks' of the classifier, and the more you optimize/sample, the more you exploit the classifier: https://openai.com/blog/measuring-goodharts-law/ My point is that this is more like a virus evolving to beat an immune system

9

Tea_Pearce t1_ja753ng wrote

models to get agents working well in sequential environments. Think [SayCan](https://say-can.github.io/assets/palm_saycan.pdf), [ChatGPT](https://openai.com/blog/chatgpt/), [Diffusion BC](https://openreview.net/forum?id=Pv1GPQzRrC8)...

2