> This behavior can be customized, but it is complicated by ambiguities in recognizing numbers within strings (because they may be formatted according to different language conventions). Once each number is recognized, it can be preprocessed to convert it into a format that allows for correct numeric sorting, such as a textual version of the IEEE numeric format.
そっすよねー
http://www.unicode.org/reports/tr10/
> Phonetic sorting of Han characters requires use of either a lookup dictionary of words or, more typically, special construction of programs or databases to maintain an associated phonetic spelling for the words in the text.
茨の道…
数字 - タイ文字 - Wikipedia https://ja.m.wikipedia.org/wiki/%E3%82%BF%E3%82%A4%E6%96%87%E5%AD%97#%E6%95%B0%E5%AD%97