微信公众号搜"智元新知"关注
微信扫一扫可直接关注哦!

c# – 按文化排序不按预期工作

为什么“Ū”首先代替“U”?

CultureInfo ci = CultureInfo.GetCultureInfo("lt-LT");
    bool ignoreCase = true; //whether comparison should be case-sensitive
    StringComparer comp = StringComparer.Create(ci,ignoreCase);
    string[] unordered = { "Za","Žb","Ūa","Ub" };
    var ordered = unordered.OrderBy(s => s,comp);

订购结果:
UA
UB
ZA
ZB

应该是:UbŪaZaŽb

这是立陶宛字母的顺序. https://www.assorti.lt/userfiles/uploader/no/norvegiska-lietuviska-delione-abecele-maxi-3-7-m-vaikams-larsen.jpg

解决方法

我刚刚为你的问题做了一个(有限的)解决方案.
这不是优化的,但它可以给出如何解决它的想法.
我创建一个LithuanianString类,它只用于封装你的字符串.
此类实现IComparable以便能够对LithuanianString列表进行排序.

这可能是班级:

public class LithuanianString : IComparable<LithuanianString>
{

    const string UpperAlphabet = "AĄBCČDEĘĖFGHIĮYJKLMnopRSŠTUŲŪVZŽ";
    const string LowerAlphabet = "aąbcčdeęėfghiįyjklmnoprsštuųūvzž";
    public string String;

    public LithuanianString(string inputString)
    {
        this.String = inputString;
    }

    public int Compareto(LithuanianString other)
    {
        var maxIndex = this.String.Length <= other.String.Length ? this.String.Length : other.String.Length;
        for (var i = 0; i < maxIndex; i++)
        {
            //We make the method non case sensitive
            var indexOfThis = LowerAlphabet.Contains(this.String[i])
                ? LowerAlphabet.IndexOf(this.String[i])
                : UpperAlphabet.IndexOf(this.String[i]);

            var indexOfOther = LowerAlphabet.Contains(other.String[i])
                ? LowerAlphabet.IndexOf(other.String[i])
                : UpperAlphabet.IndexOf(other.String[i]);

            if (indexOfOther != indexOfThis)
                return indexOfThis - indexOfOther;
        }
        return this.String.Length - other.String.Length;
    }
}

以下是我尝试的示例:

static void Main(string[] args)
    {
        string[] unordered = { "Za","Ub" };

        //Create a list of lithuanian string from your array
        var lithuanianStringList = (from unorderedString in unordered
            select new LithuanianString(unorderedString)).ToList();
        //Sort it 
        lithuanianStringList.sort();

        //display it
        Console.WriteLine(Environment.NewLine + "My Comparison");
        lithuanianStringList.ForEach(c => Console.WriteLine(c.String));
    }

输出是预期的:

Ub Ūa Za Žb

此类允许仅通过替换开头定义的两个常量中的字符来创建自定义字母表.

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。

相关推荐