Very interesting thread about reversal knowledge(twitter.com)

posted 1 year ago

noneabove1182@sh.itjust.worksM

localllama@sh.itjust.works

Reversal knowledge in this case being, if the LLM knows that A is B, does it also know that B is A, and apparently the answer is pretty resoundingly no! I’d be curious to see if some CoT affected the results at all

Sort:

Hot Top Controversial New Old

You are viewing a single thread.

View all comments

[ - ]

rufus@discuss.tchncs.de

8 points

1 year ago

Meh. Either I’m doing something wrong. Or we should stop linking (only) twitter posts. I can only see the original 42 words and a picture. No mentioned paper or thread that clarifies what this means.

For other people with the same problem, here’s the website of the person: https://owainevans.github.io/

And here’s the mentioned paper: https://owainevans.github.io/reversal_curse.pdf

permalink

report

[ - ]

Discover5164@lemm.ee

3 points

1 year ago

thank you

permalink

report

parent