farmingvillein t1_j8frv87 wrote on February 13, 2023 at 11:52 PM

Reply to comment by pyepyepie in [R] [N] Toolformer: Language Models Can Teach Themselves to Use Tools - paper by Meta AI Research by radi-cho

> not to use language models to interact with the world (which seems trivial to me, sorry),

The best argument here is that "true" intelligent requires "embedded" agents, i.e., agents that can interact with our (or, at least, "a") world (to learn).

Obviously, no one actually knows what will make AGI work, if anything...but it isn't a unique/fringe view OP is suggesting.

farmingvillein t1_j86xbpu wrote on February 12, 2023 at 2:22 AM

Reply to comment by impossiblefork in [D] Can Google sue OpenAI for using the Transformer in their products? by t0t0t4t4

Additionally, Google has released many open source repositories with transformers and appropriate licensing.

farmingvillein t1_j7n41tv wrote on February 8, 2023 at 12:07 AM

Reply to comment by Zetus in [Discussion] Is ChatGPT and/or OpenAI really the leader in the space? by wonderingandthinking

There seems to be basically zero info about wu dao 2, which makes it hard to take seriously as SOTA.

farmingvillein t1_j7jboe1 wrote on February 7, 2023 at 5:00 AM

Reply to comment by visarga in [D] List of Large Language Models to play with. by sinavski

bloom is pretty terrible, unfortunately

farmingvillein t1_j7ibgcn wrote on February 7, 2023 at 12:12 AM

Reply to comment by starstruckmon in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

> wrong information from these models is pretty rare

This is not born at out all by the literature. What are you basing this on?

There are still significant problems--everything from source material being ambiguous ("President Obama today said", "President Trump today said"--who is the U.S. President?) to problems that require chains of logic happily hallucinating due to one part of the logic chain breaking down.

Retrieval models are conceptually very cool, and seem very promising, but statements like "pretty rare" and "don't have that issue" are nonsense--at least on the basis of published SOTA.

Statements like

> I don't think it needs to be 100% resolved for it to be a viable replacement for a search engine.

are fine--but this is a qualitative value judgment, not something grounded in current published SOTA.

Obviously, if you are sitting at Google Brain and privy to next-gen unpublished solutions, of course my hat is off to you.

farmingvillein t1_j7i567e wrote on February 6, 2023 at 11:27 PM

Reply to comment by visarga in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

This is an interesting choice--on the one hand, understandable, on the other, if it looks worse than chatgpt, they are going to get pretty slammed in the press.

Maaaybe they don't immediately care, in that what they are trying to do is head off Microsoft offering something really slick/compelling in Bing. Presumably, then, this is a gamble that Microsoft won't invest in incorporating a "full" chatgpt in their search.

farmingvillein t1_j7i4iiu wrote on February 6, 2023 at 11:23 PM

Reply to comment by starstruckmon in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

> Retrieval augmented models ( whether via architecture or prompt ) don't have that issue.

Err. Yes they do.

They are generally better, but this is far from a solved problem.

farmingvillein t1_j7i4ed7 wrote on February 6, 2023 at 11:22 PM

Reply to comment by mettle in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

> how would you even do that?

r/yeluapyeroc just reviews each post, np

farmingvillein t1_j7i2r6c wrote on February 6, 2023 at 11:11 PM

Reply to comment by VelveteenAmbush in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

Of course--but it isn't openai, per se, that they are scared of, it is the bing distribution platform.

farmingvillein t1_j7hgqs3 wrote on February 6, 2023 at 8:48 PM

Reply to comment by mugbrushteeth in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

Really more about bing...which is a statement which seems kinda crazy to write...

farmingvillein t1_j6nxa0i wrote on January 31, 2023 at 5:49 PM

Reply to comment by ezelikman in [R] Parsel: A (De-)compositional Framework for Algorithmic Reasoning with Language Models - Stanford University Eric Zelikman et al - Beats prior code generation sota by over 75%! by Singularian2501

> If you generate 10 complete implementations, you have 10 programs. If you generate 10 implementations of four subfunctions, you have 10,000 programs. By decomposing problems combinatorially, you call the language model less

Yup, agreed--this was my positive reference to "the big idea". Decomposition is almost certainly very key to any path forward in scaling up automated program generation in complexity, and the paper is a good example of that.

> Parsel is intentionally basically indented natural language w/ unit tests. There's minimal extra syntax for efficiency and generality.

I question whether the extra formal syntax is needed, at all. My guess is, were this properly ablated, it probably would not be. LLMs are--in my personal experience, and this is obviously born out thematically--quite flexible to different ways in representing, say, unit input and outputs. Permitting users to specify in a more arbitrary manner--whether in natural language, pseudocode, or extant programming languages--seems highly likely to work equally well, with some light coercion (i.e., training/prompting). Further, natural language allows test cases to be specified in a more general way ("unit tests: each day returns the next day in the week, Sunday=>Monday, ..., Saturday=>Sunday") that LLMs are well-suited to work with. Given LLM's ability to pick up on context and apply it, as well, there is a good chance that free-er form description of test cases are likely to drive improved performance.

If you want to call that further research--"it was easier to demonstrate the value of hierarchical decomposition with a DSL"--that's fine and understood, but I would call it out as a(n understandable) limitation of the paper and an opportunity for future research.

farmingvillein t1_j6n4hqy wrote on January 31, 2023 at 2:47 PM

Reply to comment by abcdchop in [R] Parsel: A (De-)compositional Framework for Algorithmic Reasoning with Language Models - Stanford University Eric Zelikman et al - Beats prior code generation sota by over 75%! by Singularian2501

> wait bro the key benefit is the the hierarchical description

agreed

> I think that the improvements your suggesting pretty much describe the paper itself

Allow users to work in actual unstructured language, or an extant programming language, and I'd agree.

farmingvillein t1_j6jgv48 wrote on January 30, 2023 at 7:41 PM

Reply to comment by theunixman in [R] Parsel: A (De-)compositional Framework for Algorithmic Reasoning with Language Models - Stanford University Eric Zelikman et al - Beats prior code generation sota by over 75%! by Singularian2501

And this isn't a good thing, it is a necessary thing--we do it because someone bundled some logic together and you need to interact with it.

None of this addresses whether or why something like Parsel is necessary as an intermediate step. The authors do very little to justify the necessity of an intermediate representation; there is no meaningful analysis of why it apparently performs better, nor an ablation analysis to try to close the gaps.

The key benefits--like enforced test cases--could, hypothetically, very easily be enforced in something like Python, or many other languages.

And given the massive volumes of training data we have for these other languages, there are a lot of good reasons to think that we should be able to see equal or better behavior than with a wholly manufactured pseudocode (effectively) language.

The paper would have been much more convincing and interesting if, e.g., they started with something like python and progressively added the restrictions that apparently helped Parsel provide higher quality results.

farmingvillein t1_j6jdazy wrote on January 30, 2023 at 7:19 PM

Reply to comment by [deleted] in [R] Parsel: A (De-)compositional Framework for Algorithmic Reasoning with Language Models - Stanford University Eric Zelikman et al - Beats prior code generation sota by over 75%! by Singularian2501

This is, at best, a distinction without a difference.

The authors literally describe it as "language".

It gets "compiled".

It generates a "Parsel program".

It holds a distinct learning curve such that a user can be an "expert".

The point here is that it is a unique specification that needs to be separately learned--it asks the user to learn, in essence, a domain-specific language. Or, if you prefer, a domain-specific specification; the point stands either way.

farmingvillein t1_j6iwb5v wrote on January 30, 2023 at 5:34 PM

Reply to [R] Parsel: A (De-)compositional Framework for Algorithmic Reasoning with Language Models - Stanford University Eric Zelikman et al - Beats prior code generation sota by over 75%! by Singularian2501

I like the big idea, and it is almost certainly indicative of one of the key tools to improve automated programming.

That said, I wish they had avoided the urge to build an intermediate programming language. This is likely unnecessary and is the type of semi-convoluted solution that you only come up with in an academic research lab (or out of true, deep product need--but I think that is highly unlikely the case).

My guess is that the same basic result in the paper could have been shown by using Python or Rust or similar as the root language, with a little work (time that you could have obtained by swapping out effort spent on the harry potter language development).

They do note:

> We generate 16 Python implementations per high-level plan on 100 randomly sampled problems and find that the performance drops to 6%.

But it isn't well-discussed (unless I skimmed too quickly) as to why a separate language is truly needed. They discussion advantages of Parsel, but there doesn't appear to be a deep ablation on why it is really necessary or where its supposed performance benefits come from, or how those could be enforced in other languages.

There is a bunch of discussion in the appendix, but IMO none of it is very convincing. E.g., Parsel enforces certain conventions around testing and validation...great, lets do that in Python or Rust or similar. Or--leveraging the value of LLMs--through a more natural language interface.

Yes, there is benefit to bridging these gap in a "universal" manner...but, as per https://xkcd.com/927/, a new programming language is rarely the right solution.

farmingvillein t1_j5utusn wrote on January 25, 2023 at 6:49 PM

Reply to comment by MysteryInc152 in [D]Are there any known AI systems today that are significantly more advanced than chatGPT ? by Xeiristotle

You're probably right, but has anyone built an updated set of benchmarks to compare chatgpt with Google's publicly released numbers? (Maybe yes? Maybe I'm out of the loop?) Chatgpt is sufficiently different than gpt3.5 that I think we'd need to rerun benchmarks to compare.

(And, of course, even if we did, there are open questions of potential data leakage--always a concern, but maybe an extra concern here, since it is unclear whether OpenAI would have prioritized that issue in chatgpt build out. Certainly would have been low on my list, personally.)

farmingvillein t1_j3xui1m wrote on January 11, 2023 at 8:30 PM

Reply to comment by Gmroo in [News] "Once $92 billion in profit plus $13 billion in initial investment are repaid (to Microsoft) and once the other venture investors earn $150 billion, all of the equity reverts back to OpenAI." by Gmroo

No, you can edit your original post and place it in there:

> OpenAI must be super confident about the generality of their AI and Microsoft product integration.

<-- add your link here.

> During weekdays, if you'd like to share a link, place it in a self-post and provide some context.

farmingvillein t1_j3xoqr8 wrote on January 11, 2023 at 7:54 PM

Reply to comment by Gmroo in [News] "Once $92 billion in profit plus $13 billion in initial investment are repaid (to Microsoft) and once the other venture investors earn $150 billion, all of the equity reverts back to OpenAI." by Gmroo

> It didn't allow me to post the link

You are allowed to place it in the self-post, unless I misunderstand your comment.

farmingvillein t1_j3xmibw wrote on January 11, 2023 at 7:40 PM

Reply to [News] "Once $92 billion in profit plus $13 billion in initial investment are repaid (to Microsoft) and once the other venture investors earn $150 billion, all of the equity reverts back to OpenAI." by Gmroo

Can we not post quotes without links to the source?

farmingvillein t1_j2lnxdd wrote on January 2, 2023 at 6:09 AM

Reply to comment by amnezzia in [D] What are good ways of incorporating non-sequential context into a transformer model? by abc220022

Or, for vectors, just slam it into the start of the sequence directly (use a normalization technique if you need to align dimensionality).

If you feel the need, place some sort of separator token ('###') between the "context features" and the input data.

farmingvillein t1_j2awxls wrote on December 30, 2022 at 10:24 PM

Reply to comment by IdentifiableParam in [R] LAMBADA: Backward Chaining for Automated Reasoning in Natural Language - Google Research 2022 - Significantly outperforms Chain of Thought and Select Inference in terms of prediction accuracy and proof accuracy. by Singularian2501

Yes, and the old one was named relatively sanely:

> LAnguage Modeling Broadened to Account for Discourse Aspects

Whereas the new Google paper is a horror show in naming:

> We develop a hybrid LAnguage Model augmented BAckwarD chAining technique, dubbed LAMBADA

farmingvillein t1_j17n5n7 wrote on December 22, 2022 at 6:53 AM

Reply to [D] Hype around LLMs by Ayicikio

Was this written by an LLM?

farmingvillein t1_j12s4mn wrote on December 21, 2022 at 6:22 AM

Reply to comment by Dankmemexplorer in [R] Nonparametric Masked Language Modeling - MetaAi 2022 - NPM - 500x fewer parameters than GPT-3 while outperforming it on zero-shot tasks by Singularian2501

unfortunately still really slow (for now) to run, however:

> the speed of NPM is still on par with the speed of significantly larger parametric models that NPM outperforms

farmingvillein t1_j0ifmkt wrote on December 16, 2022 at 9:46 PM

Reply to comment by Own-Plantain8065 in [P] Medical question-answering without hallucinating by tmblweeds

This also would probably be a good way to gather data on where the model may not be working.

If a relatively recent systematic review is giving a different result than a contemporaneous and/or older set of papers, it is probably (would need to verify this empirically) more likely that something is being processed incorrectly.

(Reviews obviously also aren't perfect--but my guess is that you'd find that they are pretty robust indicators of something being off.)

farmingvillein t1_j0fh5lg wrote on December 16, 2022 at 6:36 AM

Reply to comment by breezedeus in [D] Is "natural" text always maximally likely according to language models ? by Emergency_Apricot_77

> If our words came out that way, people would know what you were going to say without even having to say it.

Even if this were true, this would not be correct in any sort of general sense, since every person/agent has its own unique set of (incompletely observable) context that seeds any output.