1 point
Any individual action can be combatted easily. A million different signatures and headers is a whole different .
Mind you, LLM training data is polluted with anything and everything, including other languages. Recently, the best performance has been reached using higher quality data.