qalis t1_j6mbu5s wrote on January 31, 2023 at 10:02 AM

Reply to [Discussion] Misinformation about ChatGPT and ML in media and where to find good sources of information by Silvestron

www.youtube.com/watch?v=5D315JD8kYg) and [GPT-3 paper](https://arxiv.org/pdf/2005.14165.pdf) to learn about GPT-3 \- [InstructGPT page](https://openai.com/blog/instruction-following/) and [InstructGPT paper](https://arxiv.org/pdf/2203.02155.pdf) to learn about InstructGPT, the sibling model of ChatGPT ... understand, this is the same as "GPT-3.5" \- [ChatGPT page](https://openai.com/blog/chatgpt/) to learn about differences between InstructGPT and ChatGPT, which are relatively small as far as I understand; it is also sometimes called ... reinforcement learning with human feedback (RLHF) \- RLHF is based on Proximal Policy Optimization algorithm \- [PPO page](https://openai.com/blog/openai-baselines-ppo/) and [PPO paper](https://arxiv.org/pdf/1707.06347.pdf)

Enough_Ad_6584 t1_ja9elj1 wrote on February 27, 2023 at 8:31 PM

Reply to comment by [deleted] in Is anyone actually using chatgpt to make gains or is it just more hype? by Popular-Sympathy-696

value in the .ai space would be company name specific. OpenAI primary site is reached via [openai.com](https://openai.com), although they own [open.ai](https://open.ai). That is very telling. Any investment in .ai that would

Tavrin t1_is2kbjf wrote on October 12, 2022 at 8:57 PM

Reply to comment by lifebeyondwalls in AIs are now expert-human-level in no-press Diplomacy and Hanabi by Ezekiel_W

already had some cooperation, but maybe not on the same level as this new paper https://openai.com/blog/openai-five-defeats-dota-2-world-champions/#cooperativemode

A_throwaway__acc t1_itm3nk3 wrote on October 24, 2022 at 5:30 PM

Reply to comment by paramach in A.I.-Generated Art Is Already Transforming Creative Work by Gari_305

something that could take a human months, in just seconds! Everyone knows of [dall-e 2](https://openai.com/dall-e-2/) but there are many popular ways to create art in seconds. In just a matter

Saytahri t1_iv0ikmm wrote on November 4, 2022 at 11:37 AM

Reply to comment by ComplexColor in [D] DALL·E to be made available as API, OpenAI to give users full ownership rights to generated images by TiredOldCrow

many times over in the dataset. They removed the duplications and then checked again, no matches. https://openai.com/blog/dall-e-2-pre-training-mitigations/

PlaysForDays t1_ivzwdc5 wrote on November 11, 2022 at 9:21 PM

Reply to comment by crumpletely in The CEO of OpenAI had dropped hints that GPT-4, due in a few months, is such an upgrade from GPT-3 that it may seem to have passed The Turing Test by lughnasadh

Open AI is not a non-profit company](https://openai.com/blog/openai-lp/), not to mention how their flagship products are _not_ open source

PlaysForDays t1_iw04wp9 wrote on November 11, 2022 at 10:23 PM

Reply to comment by zephyy in The CEO of OpenAI had dropped hints that GPT-4, due in a few months, is such an upgrade from GPT-3 that it may seem to have passed The Turing Test by lughnasadh

public benefit company while putting enough content behind walls that [Microsoft is willing to pay](https://openai.com/blog/openai-licenses-gpt-3-technology-to-microsoft/) to knock them down. Even stuff that's free-as-in-beer is not free

potatodemon t1_iwease5 wrote on November 15, 2022 at 12:19 AM

Reply to comment by badhandml in [D] ML/AI role as a disabled person by badhandml

shockingly good and open-source if you want to play around with speech detection via Python https://openai.com/blog/whisper/

SoylentRox t1_iyyvlhg wrote on December 5, 2022 at 5:04 AM

Reply to comment by Head_Ebb_5993 in bit of a call back ;) by GeneralZain

environments, some accurate enough to *immediately* use in the real world - see here for an example [https://openai.com/blog/solving-rubiks-cube/](https://openai.com/blog/solving-rubiks-cube/) \- to force an agent to develop intelligence. (2) neuroscientists have known for years that the brain

pyepyepie t1_iz0wj9m wrote on December 5, 2022 at 5:34 PM

Reply to comment by trnka in [D] What is the advantage of multi output regression over doing it individually for each target variable by triary95

money to know the systems we build :) BTW, what you talk about seems related to this https://openai.com/blog/deep-double-descent/ (deep double descent). That phenomenon is clearly magic :D I have heard some explanations about weight

VirtualHat t1_iz2qj72 wrote on December 6, 2022 at 12:59 AM

Reply to comment by Oceanboi in [D] Determining the right time to quit training (CNN) by thanderrine

much is good". If you are inserted in this subject, I'd highly recommend [https://openai.com/blog/deep-double-descent/](https://openai.com/blog/deep-double-descent/) (which is about overparameterization), as well as the paper mentioned above (which is about over-training). Again

the-sun-is-gone t1_izeztv6 wrote on December 8, 2022 at 4:59 PM

Reply to comment by [deleted] in What do you think of all the recent very vocal detractors of AI generated art? by razorbeamz

Open AI explaining how they invented their own legal structure because nothing else worked for them: [https://openai.com/blog/openai-lp/](https://openai.com/blog/openai-lp/) Stable Diffusion release info with no mention of “artists”: [https://stability.ai/blog/stable-diffusion-announcement](https://stability.ai/blog/stable-diff) A great article summarizing

maxToTheJ t1_izuwe6z wrote on December 12, 2022 at 12:50 AM

Reply to [D] - Has Open AI said what ChatGPT's architecture is? What technique is it using to "remember" previous prompts? by 029187

openai.com/blog/chatgpt/ https://huggingface.co/blog/rlhf EDIT: WTF is up with the downvotes. Many of the answers to OPs questions are in the f'ing blog? >Has Open AI said what ChatGPT's architecture

pythoslabs t1_j00ltu7 wrote on December 13, 2022 at 5:15 AM

Reply to comment by SgtSlice in I made a novel completely using gpt-3 by youneshlal77

hereby assigns to you all its right, title and interest in and to Output."* Reference link : [https://openai.com/api/policies/terms/](https://openai.com/api/policies/terms/) In other words .. the OP has the right to the content he has generated using

Nameless1995 t1_j0olnhi wrote on December 18, 2022 at 6:21 AM

Reply to comment by CalligrapherFine6407 in [D] ChatGPT, crowdsourcing and similar examples by mvujas

when more "uncertain" (probably based on perplexity or something IDK exactly how the enforce cautiousness)): See: https://openai.com/blog/chatgpt/ > ChatGPT sometimes writes plausible-sounding but incorrect or nonsensical answers. Fixing this issue is challenging

Think_Olive_1000 t1_j17ndks wrote on December 22, 2022 at 6:56 AM

Reply to comment by SendMePicsOfCat in Why do so many people assume that a sentient AI will have any goals, desires, or objectives outside of what it’s told to do? by SendMePicsOfCat

openai.com/blog/faulty-reward-functions/ First result I get when I google reinforcement learning short circuit. Pretty well known issue breh >The RL agent finds an isolated lagoon where it can turn in a large circle

lambolifeofficial OP t1_j22beim wrote on December 29, 2022 at 3:26 AM

Reply to comment by 4e_65_6f in ChatGPT Could End Open Research in Deep Learning, Says Ex-Google Employee by lambolifeofficial

mean this guy? https://openai.com/blog/authors/alec/

Ortus14 t1_j2luhse wrote on January 2, 2023 at 7:25 AM

Reply to comment by Nalmyth in Alignment, Anger, and Love: Preparing for the Emergence of Superintelligent AI by Nalmyth

build a sufficiently aligned AI system that can help us solve all other alignment problems." [https://openai.com/blog/our-approach-to-alignment-research/](https://openai.com/blog/our-approach-to-alignment-research/) ChatGTP has some alignment in avoiding racist and sexist behavior, as well as many other human morals

airduster_9000 t1_j2mrgtl wrote on January 2, 2023 at 2:11 PM

Reply to comment by Akimbo333 in Why can artificial intelligences currently only learn one type of thing? by ItsTimeToFinishThis

opposite of DALL-E: it creates a text-description for a given image. Read more here: [https://openai.com/blog/clip/](https://openai.com/blog/clip/)

alainreid t1_j32mbbv wrote on January 5, 2023 at 5:10 PM

Reply to comment by fangedrandy in Helium-3 by fangedrandy

should catch up on this: https://openai.com/blog/chatgpt/

ndemir t1_j3aek5j wrote on January 7, 2023 at 3:25 AM

Reply to comment by What_The_Hex in Is there an AI tool that can specifically isolate sentences or chunks of text, from larger bodies of text, that meet a certain narrow criteria -- then output those as the result? - [D] by What_The_Hex

ChatGPT is based on GPT. 2 links for a quick intro; https://en.m.wikipedia.org/wiki/GPT-3 https://openai.com/blog/chatgpt/

keepthepace t1_j3fzg3p wrote on January 8, 2023 at 7:44 AM

Reply to comment by LesleyFair in [N] 7 Predictions From The State of AI Report For 2023 ⭕ by LesleyFair

about Microsoft's [2019 1B investment in OpenAI then?](https://openai.com/blog/microsoft/)

SoylentRox t1_j3jtjb1 wrote on January 9, 2023 at 1:38 AM

Reply to 5 Dumbest thing Artificial Intelligence can not do by therealsam44

accuracy and efficiency. *Some of the solutions to RL environments are pretty creative, like box surfing.* [*https://openai.com/blog/emergent-tool-use/*](https://openai.com/blog/emergent-tool-use/) Answer The Ultimate Question of life the Universe and everything *humans can't* Solve Annoying Interview

--algo t1_j43rpre wrote on January 12, 2023 at 11:33 PM

Reply to comment by Hyper1on in [D] Microsoft ChatGPT investment isn't about Bing but about Cortana by fintechSGNYC

sure? This implies otherwise: https://openai.com/blog/instruction-following/ But maybe it's only for the non-codex models

becausecurious t1_j49ltox wrote on January 14, 2023 at 3:00 AM

Reply to [D] Is MusicGPT a viable possibility? by markhachman

There is https://openai.com/blog/jukebox/, which does generate music from prompt, but the prompt is not as detailed ("Hip Hop, in the style of Kanye West") and I don't think there is an easy

Self-Organizing-Dust t1_j4d9kty wrote on January 14, 2023 at 10:15 PM

Reply to [D] Is MusicGPT a viable possibility? by markhachman

uses GPT-2 to generate multi-part midi from text prompts. You can try it here: https://openai.com/blog/musenet/

gtancev t1_j4rrpm6 wrote on January 17, 2023 at 8:29 PM

Reply to [D] Are there any results on convergence guarantees when optimizing NNs? by Dartagnjan

Double descent](https://openai.com/blog/deep-double-descent/) may also be of interest

Poncho_au t1_j4uvrsl wrote on January 18, 2023 at 12:46 PM

Reply to comment by deeeznotes in ChatGPT won't kill Google, it will help it. Generative AI's biggest impact will be on office apps, not search engines. by cartoonzi

openai.com website. It’ll be probably your first link in google

superluminary t1_j5pl1fo wrote on January 24, 2023 at 6:03 PM

Reply to comment by LoquaciousAntipodean in The 'alignment problem' is fundamentally an issue of human nature, not AI engineering. by LoquaciousAntipodean

Really? Prove it. https://openai.com/blog/instruction-following/ The engineers collect large amounts of user input in an open public beta, happening right now. Sometimes (because it was trained on all the text on the internet

TheKing01 OP t1_j6715ow wrote on January 28, 2023 at 4:07 AM

Reply to comment by Cryptizard in Don't despair; there is decent likelihood that an extremely large amount of resources will flow from AGI to the common man (even without UBI) by TheKing01

profit, they are expected to follow their [charter](https://openai.com/charter) to some extent: > We commit to use any influence we obtain over AGI’s deployment to ensure it is used for the benefit

krand16 t1_j6whjv5 wrote on February 2, 2023 at 11:28 AM

Reply to [N] OpenAI starts selling subscriptions to its ChatGPT bot by bikeskata

Direct link to blog](https://openai.com/blog/chatgpt-plus/)

was_der_Fall_ist t1_j6xz6wj wrote on February 2, 2023 at 6:08 PM

Reply to comment by koolaidman123 in [D] Why do LLMs like InstructGPT and LLM use RL to instead of supervised learning to learn from the user-ranked examples? by alpha-meta

prompt and several outputs are generated. A labeler ranks the outputs from best to worst.” https://openai.com/blog/chatgpt/

thelibrarian101 t1_j7p27ns wrote on February 8, 2023 at 11:53 AM

Reply to comment by levand in Is there any AI-distinguishing models? by Such_Share8197

this, openai itself is pretty mediocre at detecting AI generated text: https://openai.com/blog/new-ai-classifier-for-indicating-ai-written-text/

boadie t1_j8hhtwi wrote on February 14, 2023 at 10:05 AM

Reply to [D] Have their been any attempts to create a programming language specifically for machine learning? by throwaway957280

view it as a small dsl, all the interesting bits are way below that level: https://openai.com/blog/triton/

TwitchTvOmo1 OP t1_j8taffb wrote on February 16, 2023 at 8:14 PM

Reply to comment by Cryptizard in What if Bing GPT, Eleven Labs and some other speech to text combined powers... by TwitchTvOmo1

that started talking like an anti-semitic 4chan user. [Here's another article by openAI](https://openai.com/blog/how-should-ai-systems-behave/) from just today, describing pretty much what I just said. >We believe that AI should

gurenkagurenda t1_j8v4i1n wrote on February 17, 2023 at 4:00 AM

Reply to comment by OccasionUnfair8094 in ChatGPT is a robot con artist, and we’re suckers for trusting it by altmorty

fact [not true](https://openai.com/blog/instruction-following/).

gurenkagurenda t1_j8v4fqg wrote on February 17, 2023 at 3:59 AM

Reply to comment by anti-torque in ChatGPT is a robot con artist, and we’re suckers for trusting it by altmorty

part_ of how ChatGPT and InstructGPT were trained, but ChatGPT and InstructGPT use [reinforcement learning](https://openai.com/blog/instruction-following/) to teach the models to do more complex tasks based on human preferences. Also, and this

anti-torque t1_j8vamcu wrote on February 17, 2023 at 4:56 AM

Reply to comment by gurenkagurenda in ChatGPT is a robot con artist, and we’re suckers for trusting it by altmorty

openai.com/blog/deep-reinforcement-learning-from-human-preferences/)

levand t1_j9qe7ev wrote on February 23, 2023 at 8:45 PM

Reply to comment by suflaj in Why bigger transformer models are better learners? by begooboi

enough that they become *less* prone to overfitting (and it's not clear why): https://openai.com/blog/deep-double-descent/

gwern t1_j9r43jv wrote on February 23, 2023 at 11:29 PM

Reply to comment by Hodoss in And Yet It Understands by calbhollo

yields 'hacks' of the classifier, and the more you optimize/sample, the more you exploit the classifier: https://openai.com/blog/measuring-goodharts-law/ My point is that this is more like a virus evolving to beat an immune system

coconautico OP t1_ja3ujgs wrote on February 26, 2023 at 5:24 PM

Reply to comment by LetterRip in [P] [N] Democratizing the chatGPT technology through a Q&A game by coconautico

they can't do anything to prevent me from using my data in other systems. Link: [https://openai.com/terms/](https://openai.com/terms/)

Tea_Pearce t1_ja753ng wrote on February 27, 2023 at 9:53 AM

Reply to [D] Is RL dead/worth researching these days? by [deleted]

models to get agents working well in sequential environments. Think [SayCan](https://say-can.github.io/assets/palm_saycan.pdf), [ChatGPT](https://openai.com/blog/chatgpt/), [Diffusion BC](https://openreview.net/forum?id=Pv1GPQzRrC8)...

ry007opyt OP t1_jahn2up wrote on March 1, 2023 at 2:40 PM

Reply to comment by PerfectMoobs in I tried 2,000 AI tools so you don’t have to. Ask me anything about how to supercharge your life with AI! by ry007opyt

better than the average human and one of them is audio transcription. OpenAI [Whisper](https://openai.com/research/whisper) is incredibly good at transcribing audio in multiple languages, even when the sound is pretty

Search

50 results for openai.com: