如何解决删除停用词
我正在尝试从数据集(数据)中删除停用词。这是我到目前为止的代码(只是尝试从一行开始)。
+-----------------------------+-----------+------------+-----------------------------------------------------------------+-------------+-----------------------+------------------+
| column_name | type_name | type_align | alignment_description | type_length | suggestioned_position | current_position |
+-----------------------------+-----------+------------+-----------------------------------------------------------------+-------------+-----------------------+------------------+
| from_assembly_id | uuid | c | char alignment,no alignment needed | 16 | 1 | 7 |
| from_associated_to_id | uuid | c | char alignment,no alignment needed | 16 | 2 | 4 |
| from_clinic_id | uuid | c | char alignment,no alignment needed | 16 | 3 | 14 |
| from_facility_department_id | uuid | c | char alignment,no alignment needed | 16 | 4 | 15 |
| from_facility_id | uuid | c | char alignment,no alignment needed | 16 | 5 | 11 |
| from_facility_location_id | uuid | c | char alignment,no alignment needed | 16 | 6 | 10 |
| from_scan_at_facility_id | uuid | c | char alignment,no alignment needed | 16 | 7 | 3 |
| from_scan_id | uuid | c | char alignment,no alignment needed | 16 | 8 | 8 |
| from_scase_id | uuid | c | char alignment,no alignment needed | 16 | 9 | 13 |
| from_sterilizer_load_id | uuid | c | char alignment,no alignment needed | 16 | 10 | 9 |
| from_washer_load_id | uuid | c | char alignment,no alignment needed | 16 | 11 | 12 |
| from_web_user_id | uuid | c | char alignment,no alignment needed | 16 | 12 | 6 |
| hsys_id | uuid | c | char alignment,no alignment needed | 16 | 13 | 16 |
| id | uuid | c | char alignment,no alignment needed | 16 | 14 | 1 |
| inv_id | uuid | c | char alignment,no alignment needed | 16 | 15 | 2 |
| to_assembly_id | uuid | c | char alignment,no alignment needed | 16 | 16 | 27 |
| to_associated_to_id | uuid | c | char alignment,no alignment needed | 16 | 17 | 5 |
| to_clinic_id | uuid | c | char alignment,no alignment needed | 16 | 18 | 25 |
| to_facility_department_id | uuid | c | char alignment,no alignment needed | 16 | 19 | 26 |
| to_facility_id | uuid | c | char alignment,no alignment needed | 16 | 20 | 23 |
| to_facility_location_id | uuid | c | char alignment,no alignment needed | 16 | 21 | 20 |
| to_scan_at_facility_id | uuid | c | char alignment,no alignment needed | 16 | 22 | 18 |
| to_scan_id | uuid | c | char alignment,no alignment needed | 16 | 23 | 17 |
| to_scase_id | uuid | c | char alignment,no alignment needed | 16 | 24 | 24 |
| to_sterilizer_load_id | uuid | c | char alignment,no alignment needed | 16 | 25 | 21 |
| to_washer_load_id | uuid | c | char alignment,no alignment needed | 16 | 26 | 22 |
| to_web_user_id | uuid | c | char alignment,no alignment needed | 16 | 27 | 19 |
| created_dts | timestamp | d | double alignment,8 bytes on many machines,but by no means all | 8 | 28 | 32 |
| from_node_dts | timestamp | d | double alignment,but by no means all | 8 | 29 | 30 |
| to_node_dts | timestamp | d | double alignment,but by no means all | 8 | 30 | 31 |
| updated_dts | timestamp | d | double alignment,but by no means all | 8 | 31 | 33 |
| num_inst | int4 | i | int alignment,4 bytes on most machines | 4 | 32 | 28 |
| seconds | int4 | i | int alignment,4 bytes on most machines | 4 | 33 | 29 |
| from_associated_to | citext | i | int alignment,4 bytes on most machines | -1 | 34 | 39 |
| from_node | citext | i | int alignment,4 bytes on most machines | -1 | 35 | 36 |
| from_to_node | citext | i | int alignment,4 bytes on most machines | -1 | 36 | 38 |
| from_to_range | tsrange | d | double alignment,but by no means all | -1 | 37 | 34 |
| from_user_name | citext | i | int alignment,4 bytes on most machines | -1 | 38 | 41 |
| is_fake | citext | i | int alignment,4 bytes on most machines | -1 | 39 | 43 |
| sequence_ | citext | i | int alignment,4 bytes on most machines | -1 | 40 | 35 |
| source_ | citext | i | int alignment,4 bytes on most machines | -1 | 41 | 42 |
| to_associated_to | citext | i | int alignment,4 bytes on most machines | -1 | 42 | 40 |
| to_node | citext | i | int alignment,4 bytes on most machines | -1 | 43 | 37 |
| to_user_name | citext | i | int alignment,4 bytes on most machines | -1 | 44 | 44 |
+-----------------------------+-----------+------------+-----------------------------------------------------------------+-------------+-----------------------+------------------+
我一直收到这个错误,我已经删除并重新安装了 Java(32 位)。我也在运行 32 位 R,但我不断收到此错误:
Sys.setenv(JAVA_HOME = "C:/Program Files (x86)/Java/jre1.8.0_291")
qdap::rm_stopwords(data[1,2],tm::stopwords("SMART"),separate = T,strip = TRUE)
我以前没有任何这样做的经验,因此希望得到一些指导。
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。