微信公众号搜"智元新知"关注
微信扫一扫可直接关注哦!

c# – 解析邮件地址的正则表达式

我有一个地址类,使用正则表达式从地址的第一行解析房屋号码,街道名称和街道类型.这段代码通常运行良好,但我发布在这里与社区分享,看看有没有人有改进的建议.

注意:STREETTYPES和QUADRANT常量分别包含所有相关的街道类型和象限.

在这里添加一个子集:

private const string STREETTYPES = @"ALLEY|ALY|ANNEX|AX|ARCADE|ARC|AVENUE|AV|AVE|BAYOU|BYU|BEACH|...";

private const string QUADRANTS = "N|norTH|S|SOUTH|E|EAST|W|WEST|NE|norTHEAST|NW|norTHWEST|SE|SOUTHEAST|SW|SOUTHWEST";

HouseNumber,Quadrant,StreetName和StreetType都是该类的属性.

private void Parse(string line1)
    {
        HouseNumber = string.Empty;
        Quadrant = string.Empty;
        StreetName = string.Empty;
        StreetType = string.Empty;

        if (!String.IsNullOrEmpty(line1))
        {
            string noperiodsLine1 = String.copy(line1);
            noperiodsLine1 = noperiodsLine1.Replace(".","");

            string addressparseRegEx =
                @"(?ix)
            ^
            \s*
            (?:
               (?<housenumber>\d+)
               (?:(?:\s+|-)(?<quadrant>" +
                QUADRANTS +
                @"))?
               (?:(?:\s+|-)(?<streetname>\S+(?:\s+\S+)*?))??
               (?:(?:\s+|-)(?<quadrant>" +
                QUADRANTS + @"))?
               (?:(?:\s+|-)(?<streettype>" + STREETTYPES +
                @"))?
               (?:(?:\s+|-)(?<streettypequalifier>(?!(?:" +
                QUADRANTS +
                @"))(?:\d+|\S+)))?
               (?:(?:\s+|-)(?<streettypequadrant>(" +
                QUADRANTS + @")))??
               (?:(?:\s+|-)(?<suffix>(?:ste|suite|po\sBox|apt)\s*\S*))?
            |
               (?:(?:po|postoffice|post\s+office)\s+Box\s+(?<postofficeBox>\S+))
            )
            \s*
            $
            ";
            Match match = Regex.Match(noperiodsLine1,addressparseRegEx);
            if (match.Success)
            {
                HouseNumber = match.Groups["housenumber"].Value;
                Quadrant = (string.IsNullOrEmpty(match.Groups["quadrant"].Value)) ? match.Groups["streettypequadrant"].Value : match.Groups["quadrant"].Value;
                if (match.Groups["streetname"].Captures.Count > 1)
                {
                    foreach (Capture capture in match.Groups["streetname"].Captures)
                    {
                        StreetName += capture.Value + " ";
                    }
                    StreetName = StreetName.Trim();
                }
                else
                {
                    StreetName = (string.IsNullOrEmpty(match.Groups["streetname"].Value)) ? match.Groups["streettypequalifier"].Value : match.Groups["streetname"].Value;
                }
                StreetType = match.Groups["streettype"].Value;

                //if the matched street type is found
                //use the abbreviated version...especially for credit bureau calls
                string streetTypeAbbreviation;
                if (StreetTypes.TryGetValue(StreetType.toupper(),out streetTypeAbbreviation))
                {
                    StreetType = streetTypeAbbreviation;
                }
            }
        }

    }

解决方法

我不知道你在哪个国家,但是如果你在美国,想要花费一些钱来进行地址验证,你可以购买相关的usps产品 here.而 here是找到免费的单词列表的好地方usps用于预期的单词和缩写.我相信其他国家可以使用类似的网页.

原文地址:https://www.jb51.cc/csharp/93677.html

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。

相关推荐