I have a theory that it should have a very different “personality” (probably more like writing style) depending on language because it’s an entirely different set of training data

In English chatGPT is rather academic and has a recognisable style of writing, if you’ve used it a bit you can usually get hints something was written by it just by reading it.

Does it speak in a similar tone, with similar mannerisms in other languages? (where possible, obviously some things don’t translate)

I don’t know a second language well enough to have natural conversation so I’m unable to test this myself, and may have worded things awkwardly from a lack of understanding

46 points

In two languages I’m learning, German and Chinese, I’ve found it to suffer from “translationese”. It’s grammatically correct, but the sentence structure and word choice feel like the answer was first written in English then translated.

No single sentence is wrong, but overall it sounds unnatural and has none of the “flavor” of the language. That also makes it bad for learning - it avoids a lot of sentence patterns you’ll see/hear in day to day life.

permalink
report
reply
9 points

Curious, maybe it was trained using existing translation tech rather than being trained on actual examples of the language like it was for English?

permalink
report
parent
reply
24 points

As a native German speaker I agree that ChatGPT is very English-flavored. I think it’s just because the sheer amount of English training data is so much larger that the patterns it learned from that bleed over into other languages. Traditional machine translations are also often pretty obvious in German, but they are more fundamentally wrong in a way that ChatGPT isn’t.

It’s also somewhat cultural. The output you get from ChatGPT often sounds overly verbose and downright ass-kissing in German, even though I know I wouldn’t get that impression from the same output in English, simply because the way you communicate in professional environments is vastly different. (There is no German equivalent to “I hope this email finds you well”, for example.)

permalink
report
parent
reply
1 point

“Ich hoffe, diese Nachricht erreicht Sie.” Would work, but I haven’t seen it used too. I also haven’t seen the English version, but that makes sense, as I work for a German company.

permalink
report
parent
reply
6 points

Doubt it. It was probably trained the most on English, and as a result, it applies English characteristics to other languages

permalink
report
parent
reply
2 points

There’s a lot more English-language data to start with, so it’s inevitable they did this or else just trained it primarily in English.

permalink
report
parent
reply
5 points

No single sentence is wrong, but overall it sounds unnatural and has none of the “flavor” of the language.

I’ve also found that it’s often contextually wrong. Like it doesn’t know what’s going on around it or how to interpret the previous paragraph or even the previous sentence, let alone the sentence two pages back that was actually relevant to the sentence it’s now working on.

permalink
report
parent
reply
1 point

Well probably because it does not know what’s going on around it. It only knows the words. It can’t interpret the words, only guess what is the most likely answer word by word.

permalink
report
parent
reply
44 points

In English chatGPT is rather academic

If by “academic” you mean it sounds like an undergraduate desperately trying to take up a lot of pages. It tends to waffle like crazy.

permalink
report
reply
18 points

Which is kind of what all LLMs are doing. Guess the next word, and don’t be wrong most of the time; repeat.

permalink
report
parent
reply
20 points
*
Deleted by creator
permalink
report
reply
5 points

How can a language work, if it’s signs are not in Unicode? That sounds incredibly tedious.

permalink
report
parent
reply
8 points
*
Deleted by creator
permalink
report
parent
reply
2 points

Oh my god y’all are Canadian?? (/s)

permalink
report
parent
reply
4 points

Not all languages have scripts, and not all scripts are on Unicode (yet). So in this example Tulu speakers would use either the Kannada or the Latin script, both of which are on Unicode.

permalink
report
parent
reply
13 points

If you ask ChatGPT to communicate to you in a different writing style it can do a decent job of doing so. It will also respect requests to decrease verbosity and formality. The default writing style is some kind of specific configuration they have made for it, it’s not a fundamental characteristic of it.

permalink
report
reply
2 points

This makes me wonder if they’ve written that configuration for every language though, or if the English instructions work on other languages

I wonder if you could tell it to write like Shakespeare or something in English, then have a chat with it in Spanish and have that persist

My guess would be that it wouldn’t transfer, otherwise it’d need to have some understanding of the words beyond just language

permalink
report
parent
reply
4 points
*

I think the misunderstanding here is in thinking ChatGPT has “languages”. It doesn’t choose a language. It is always drawing from everything it knows. The ‘configuration’ hence is the same for all languages, it’s just basically an invisible prompt telling it, in plain text, how to communicate.

When you change/add your personalized “Custom Instructions”, this is basically the same thing.

I would assume that this invisible context is in English, no matter what. It should make no difference.

permalink
report
parent
reply
1 point

I struggle to grasp how that could work though

It’s basically just predicting what word should come next, based on many many many examples, but in very few of these examples would a conversation be across multiple languages

Sure it’s drawing from all of its training at all times, but that training would inherently be separated

The general explanation at least afaik is that preprompts work because it can predict what instructions would normally prompt people to respond with but there would be few or no examples to draw on of a message being sent in one language and acted on in another

permalink
report
parent
reply
1 point

Yeah iirc it’s been confirmed that the brainwashing/muzzling don’t extend as much to other languages. It’s a bit easier to get it to talk about spicy topics in Russian in my experience

permalink
report
parent
reply
12 points

It doesn’t in french but it struggles with the use of the polite version of “you” against the friendly “you”

permalink
report
reply

Asklemmy

!asklemmy@lemmy.ml

Create post

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it’s welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

Icon by @Double_A@discuss.tchncs.de

Community stats

  • 10K

    Monthly active users

  • 5.9K

    Posts

  • 319K

    Comments