在 R 中，找到增加 1 的 1 个或多个数字的序列的开始和结束索引的有效方法是什么

如何解决在 R 中，找到增加 1 的 1 个或多个数字的序列的开始和结束索引的有效方法是什么

我有一个数字向量：
SampleVector <- c(2,4,7,8,9,12,14,16,17,19,23,24,25,26,27,29)
我想在增加 1 的序列的开始和结束处找到元素的索引，但我也想要不属于序列的元素的索引。
另一种说法是：我想要所有不在单步序列中的元素的索引。
对于 SampleVector，我想要的索引是：
Desiredindices <- c(1,2,3,5,6,10,11,15,16)
也就是说，除了数字 8（在 7:9 序列中）和数字 24、25 和 26（在 23:27 序列中）之外的所有内容。
到目前为止，我最好的尝试是：

SequenceStartAndEndindices <- function(vector){
  DifferenceVector          <- diff(vector)
  DiffRunLength             <- rle(DifferenceVector)
  IndicesOfSingleElements   <- which(DifferenceVector > 1) + 1
  IndicesOfEndOfSequences   <- cumsum(DiffRunLength$lengths)[which((DiffRunLength$lengths * DiffRunLength$values) == DiffRunLength$lengths)] + 1
  IndicesOfStartsOfSequences<- c(1,head(IndicesOfEndOfSequences+1,-1))
  UniqueIndices             <- unique(c(IndicesOfStartsOfSequences,IndicesOfEndOfSequences,IndicesOfSingleElements))
  Sortedindices             <- UniqueIndices[order(UniqueIndices)]
  return(Sortedindices)
}

这个函数给了我正确的答案：

> SequenceStartAndEndindices(vector = SampleVector)
 [1]  1  2  3  5  6  7  8  9 10 11 15 16

..但它几乎不可能遵循，并且它的普遍适用性并不明显。有没有更好的方法，或者某个包中的现有函数？

作为背景，这样做的目的是帮助将距离标记的长向量解析为人类可读的内容，例如而不是“在公里处：1、8、9、10、11、13”，我将能够提供“在公里处：1、8 到 11 和 13”。

解决方法

这应该有效，因为在以下情况下不包括值的索引：1）该值比前一个值大 1； 2) 比下一个少 1。

{
    "blog_title": "My blog using API","blog_description": "Update: This is a test blog. UPDATED.","blog_user_id": 10,"user_name": "admin"
}

您可以尝试在基数 R 中使用 tapply 来创建连续数字组。

SampleVector <- c(2,4,7,8,9,12,14,16,17,19,23,24,25,26,27,29)

toString(tapply(SampleVector,cumsum(c(TRUE,diff(SampleVector) > 1)),function(x) {
          if(length(x) == 1) x else paste(x[1],x[length(x)],sep = ' to ')
}))

#[1] "2,7 to 9,16 to 17,23 to 27,29"

在 R 中，找到增加 1 的 1 个或多个数字的序列的开始和结束索引的有效方法是什么

如何解决在 R 中，找到增加 1 的 1 个或多个数字的序列的开始和结束索引的有效方法是什么

解决方法

相关推荐