Investigating Cultural Value Selection Biases in LLMs

General thought:

Interesting idea; for most part, I thought it reads quite well. The methods are interesting.
I will be speaking from a beginner’s perspective — so I will focusing more on what I don’t understand; and therefore needs more explanation / clarity.
Some assumptions that I don’t understand
- What does LLM values mean? Maybe some examples?
- What are the issues if LLM fail to be “multi-cultural”? Any real-world implications?
- In modern LLMs, I don’t think the system prompt is static? While for LLMs tested and other older LLMs, there is no memory — so the system prompt also does not contain the user’s country. Why do you use system vs. user prompt and not just conflicting countries in the system prompt?
I don’t get what does Composite Identity, User and System baselines from Section 5. What country-pair are we talking about? Is it different from the methodology?
Idk what’s going on with Figure 3 and 4.
- How does the Shifted to Sys and Shifted to User points calculated?
- Where are the global south (Myamar and friends)? Why are they not in the Figure?
- I can’t see the “clear upward left movement”, and the “shift in clusters to the lower-right part of the plot”
- I can’t clearly see the Western countries in Figure 4.
- Why are they multiple data points for each country?

Explorer