微信公众号搜"智元新知"关注
微信扫一扫可直接关注哦!

链接和嵌套库的文本匹配大于与其匹配的库

如何解决链接和嵌套库的文本匹配大于与其匹配的库

我想链接和嵌套文本。我可以使用以下R代码向后进行此操作。

 BV.MC.Entry.data.AB = data.frame(unique_female_id=c("abc123","ade456","dde345","abc1234","abc12"))
BV.MC.Entry = data.frame(unique_id=c("abc123","dde345"))

to_search_in.female <- data.table(BV.MC.Entry.data.AB[!duplicated(BV.MC.Entry.data.AB$unique_female_id),c(1)])
colnames(to_search_in.female)=c("unique_female")

to_search_with.female <- tibble(BV.MC.Entry[!duplicated(BV.MC.Entry$unique_id),c(1)])
colnames(to_search_with.female) = "unique_female_id"

dim(to_search_in.female);dim(to_search_in.male);dim(to_search_with.male);dim(to_search_with.female)

linked.female.peds = to_search_with.female %>%
  mutate(data = list(to_search_in.female)) %>%
  unnest(data) %>%
  filter(str_detect(unique_female,fixed(unique_female_id))) %>%
  #select(unique_female,female.pedigree,unique_female_id) %>%
  group_by(unique_female_id) %>%
  summarise(strings = str_c(unique_female,collapse = ",")) 

linked.female.peds = linked.female.peds %>% tidyr::separate("strings",sep="[,][ ]",c("match1","match2","match3","match4","match5","match6","match7","match8","match9","match10","match11","match12","match13","match14","match15","match16"),extra="merge",remove=F)
linked.female.peds

# A tibble: 3 x 18
  unique_female_id strings match1 match2 match3 match4
  <fct>            <chr>   <chr>  <chr>  <chr>  <chr> 
1 abc123           abc123~ abc123 abc12~ NA     NA    
2 ade456           ade456  ade456 NA     NA     NA    
3 dde345           dde345  dde345 NA     NA     NA    

我希望结果返回此数据帧:

所以它将代替abc12前进,并且match2列将等于abc1234,NA,NA。

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。