Word2Vec.Net 介绍
Word2Vec.Net 是单词转换成向量形式工具Word2Vec .NET版本。
使用示例代码:
var builder = Word2VecBuilder.Create(); if ((i = ArgPos("-train", args)) > -1) builder.WithTrainFile(args[i + 1]); if ((i = ArgPos("-output", args)) > -1) builder.WithOutputFile(args[i + 1]); //to all other parameters will be set default values var word2Vec = builder.Build(); word2Vec.TrainModel(); var distance = new distance(args[i + 1]); BestWord[] bestwords = distance.Search("some_word");
或者
//more explicit option string trainfile="C:/data.txt"; string outputFileName = "C:/output.bin"; var word2Vec = Word2VecBuilder.Create() .WithTrainFile(trainfile)// Use text data to train the model; .WithOutputFile(outputFileName)//Use to save the resulting word vectors / word clusters .WithSize(200)//Set size of word vectors; default is 100 .WithSaveVocubFile()//The vocabulary will be saved to <file> .WithDebug(2)//Set the debug mode (default = 2 = more info during training) .WithBinary(1)//Save the resulting vectors in binary moded; default is 0 (off) .WithCBow(1)//Use the continuous bag of words model; default is 1 (use 0 for skip-gram model) .WithAlpha(0.05)//Set the starting learning rate; default is 0.025 for skip-gram and 0.05 for CBOW .WithWindow(7)//Set max skip length between words; default is 5 .WithSample((float) 1e-3)//Set threshold for occurrence of words. Those that appear with higher frequency in the training data twill be randomly down-sampled; default is 1e-3, useful range is (0, 1e-5) .WithHs(0)//Use Hierarchical softmax; default is 0 (not used) .WithNegative(5)//Number of negative examples; default is 5, common values are 3 - 10 (0 = not used) .WithThreads(5)//Use <int> threads (default 12) .WithIter(5)//Run more training iterations (default 5) .WithMinCount(5)//This will discard words that appear less than <int> times; default is 5 .WithClasses(0)//Output word classes rather than word vectors; default number of classes is 0 (vectors are written) .Build(); word2Vec.TrainModel(); var distance = new distance(outputFile); BestWord[] bestwords = distance.Search("some_word");
Word2Vec.Net 官网
https://github.com/eabdullin/Word2Vec.Net#word2vecnet
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。