[Chinese Sorting] Sort by name is messed up

Found a bug in "Everything"? report it here
Post Reply
775405984
Posts: 40
Joined: Tue May 30, 2017 5:32 pm

[Chinese Sorting] Sort by name is messed up

Post by 775405984 »

Hi, my name is Matt and I'm Chinese.

I find this software very useful. But when I type a random word into the search bar, the sorting method is quite messed up.

The files and folders with Chinese names were not sorted alphabetically(Pinyin), instead, they use an outdated method called Stroke order.

No one uses this kind of ancient technique, except someone with zero knowledge of Pinyin.

Some might say you can change the sorting method in Control Panal on Windows. I did, and nothing happens.

So I was wondering if you guys fix it, please!!! Thank you!!!

https://en.wikipedia.org/wiki/Stroke_order
https://en.wikipedia.org/wiki/Pinyin
775405984
Posts: 40
Joined: Tue May 30, 2017 5:32 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by 775405984 »

were you ever gonna fix this ever? Come on, please.
NotNull
Posts: 5461
Joined: Wed May 24, 2017 9:22 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by NotNull »

775405984 wrote:were you ever gonna fix this ever? Come on, please.
It is number 227 on the to do list

Cut @void some slack ....
Debugger
Posts: 630
Joined: Thu Jan 26, 2017 11:56 am

Re: [Chinese Sorting] Sort by name is messed up

Post by Debugger »

NotNull - such a long list will be realized but in the next century :lol:
Debugger
Posts: 630
Joined: Thu Jan 26, 2017 11:56 am

Re: [Chinese Sorting] Sort by name is messed up

Post by Debugger »

I am most annoyed with the need to introduce the preceding expressions, it is a waste of time for me and not everyone remembers all these.
void
Developer
Posts: 16745
Joined: Fri Oct 16, 2009 11:31 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by void »

Currently, Everything sorts filenames by unicode code points, which is completely wrong, but fast!

I have added to my TODO list to support Unicode Collation Algorithm (UCA). Hopefully this will be available in the next release of Everything.

While this is not pinyin, it might be 'good enough'.
Implementing Pinyin at this stage will be unfeasible. The sorting rules are too complex, I can't use third party sorting or the Windows API to sort as these could change at any time and it is critical the Everything database is sorted in a specific way.

I also have concerns about UCA as storing these collation lookup tables will require quite a bit of data.. (128K+)
There is also a small performance hit with adding collation lookup tables.

https://unicode.org/faq/collation.html
void
Developer
Posts: 16745
Joined: Fri Oct 16, 2009 11:31 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by void »

775405984
Posts: 40
Joined: Tue May 30, 2017 5:32 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by 775405984 »

void wrote: Sun Jul 11, 2021 11:49 pm The Everything 1.5 alpha adds support for sorting by Unicode weights.
I updated to 1269a, and it changes nothing.

Thanks for trying, but I'm not seeing the improvements.
therube
Posts: 4977
Joined: Thu Sep 03, 2009 6:48 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by therube »

Maybe some hints as to what it is not doing correctly?

And is it (at least) "correct" per the Unicode Collation Algorithm?
775405984
Posts: 40
Joined: Tue May 30, 2017 5:32 pm

Re: [Chinese Sorting] Sort by name is messed up

Post by 775405984 »

therube wrote: Mon Jul 26, 2021 5:21 pm Maybe some hints as to what it is not doing correctly?

And is it (at least) "correct" per the Unicode Collation Algorithm?
Screenshot_2.png
Screenshot_2.png (97.42 KiB) Viewed 12958 times
This is an example of how messed up sorting by name is.

I didn't find anything wrong with UCA though. What I did find is that Unicode doesn't support pinyin.

You can link UCA character to GBK,GBK support pinyin.

I'm not a developer, so I don't know how to do it. I'll post this to a Chinese website to see if anyone have any ideas.

Thank you!
Post Reply