微信公众号搜"智元新知"关注
微信扫一扫可直接关注哦!

ruby-on-rails – Rails每天只保留许多记录中的1个.保持最后,删除休息

Rails 3.1,Ruby 1.9.2,AR / MySQL.

如果相同类型的结果在此期间有很多结果,我正在寻找关于如何每个时间段(日)仅保留1个结果的建议.一个例子可能是跟踪股票价格.最初,我们会每15分钟节省一次价格,但只需要存储每个价格点一周.在第一周之后,我们每天只需要1个价格(最后记录,收盘价).

这是一个简单的第一次尝试,但是非常低效:

# stock has many prices,price has one stock
# get all prices for single stock older than 1 week
prices = stock.prices.where("created_at < ? ",Time.Now-1.week)  
prices.group_by{ |price| price.created_at.to_date }.each do |k,v| # group by day
  if v.count > 1  # if many price points that day
    (v[0]..v[v.size-2]).each {|r| r.delete} # delete all but last record in day
  end
end

在此先感谢您的任何帮助/建议.我将尝试更新,因为我通过它,希望它将帮助有人下线.

解决方法

您可以通过在sql中完成所有操作,并通过将范围限制为上次运行它来提高效率.此外,如果添加一列以将较旧的日终条目标记为“已存档”,则会使查询更加简单.存档价格是您在一周后不会删除的价格.

rails generate migration add_archived_to_prices archived:boolean

迁移之前,请在created_at列上修改迁移到索引.

class AddArchivedToPrices < ActiveRecord::Migration
  def self.up
    add_column :prices,:archived,:boolean
    add_index :prices,:created_at
  end

  def self.down
    remove_index :prices,:created_at
    remove_column :prices,:archived
  end
end

工作流程将是这样的:

# Find the last entry for each day for each stock using sql (more efficient than finding these in Ruby)
keepers =
  Price.group('stock_id,DATE(created_at)').
        having('created_at = MAX(created_at)').
        select(:id).
        where('created_at > ?',last_run) # Keep track of the last run time to speed up subsequent runs

# Mark them as archived
Price.where('id IN (?)',keepers.map(&:id)).update_all(:archived => true)

# Delete everything but archived prices that are older than a week
Price.where('archived != ?',true).
      where('created_at < ?",Time.Now - 1.week).
      where('created_at > ?',last_run). # Keep track of the last run time to speed up subsequent runs
      delete_all

最后要注意的是,请确保不要将group()和update_all()组合在一起.使用update_all()忽略group().

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。

相关推荐