微信公众号搜"智元新知"关注
微信扫一扫可直接关注哦!

从R中的网址列表下载多个pdf文件时出错

如何解决从R中的网址列表下载多个pdf文件时出错

我有一个网址列表,并尝试通过lapply下载它们链接到的pdf。即使弹出下载栏,我也收到以下消息,但文件未下载:

仅使用“ destfile”参数的第一个元素,尝试输入网址“ https://reliefweb.int/sites/reliefweb.int/files/resources/hno_car_2021_final_fr.pdf” 内容类型“应用程序/ pdf”长度22087482字节(21.1 MB) 下载21.1 MB

names<- lapply(pdf,basename) # get names
destination<- paste0 ("~/",names)
lapply(pdf,download.file,destfile=destination)
pdf
[[1]]
[1] "https://reliefweb.int/sites/reliefweb.int/files/resources/hno_car_2021_final_fr.pdf"

[[2]]
[1] "https://reliefweb.int/sites/reliefweb.int/files/resources/rnro_centralsahel_oct_2020_fr_web.pdf"

[[3]]
[1] "https://reliefweb.int/sites/reliefweb.int/files/resources/rnro_centralsahel_oct_2020_en_web.pdf"

[[4]]
[1] "https://reliefweb.int/sites/reliefweb.int/files/resources/hno_2020-final.pdf"

[[5]]
[1] "https://reliefweb.int/sites/reliefweb.int/files/resources/hno_light_2020-en_final_0.pdf"

[[6]]
[1] "https://reliefweb.int/sites/reliefweb.int/files/resources/20200701_HNO_CENTROAMERICA%20ADDENDUM%20ING.pdf"

[[7]]
[1] "https://reliefweb.int/sites/reliefweb.int/files/resources/20200706%20ONEPAGER%20HNO%20Centroame%CC%81rica%20ING.pdf"

[[8]]
[1] "https://reliefweb.int/sites/reliefweb.int/files/resources/cmr_hno_2020-revised_print.pdf"

[[9]]
[1] "https://reliefweb.int/sites/reliefweb.int/files/resources/20200616_HNO_CENTROAMERICA%20ADDENDUM.pdf"

解决方法

我会立即处理所有事情。请参阅前两个文件的代码示例。

base_string = "hello,i am a string,i am surrounded by quotes"
array_from_string = base_string.split.each do |e|
  e.replace('foo') if e.match(/(.+)?s/).nil?
end

puts array_from_string.join(' ')

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。