Rails 3.1,Ruby 1.9.2,AR /
MySQL.
如果相同类型的结果在此期间有很多结果,我正在寻找关于如何每个时间段(日)仅保留1个结果的建议.一个例子可能是跟踪股票价格.最初,我们会每15分钟节省一次价格,但只需要存储每个价格点一周.在第一周之后,我们每天只需要1个价格(最后记录,收盘价).
这是一个简单的第一次尝试,但是非常低效:
# stock has many prices,price has one stock # get all prices for single stock older than 1 week prices = stock.prices.where("created_at < ? ",Time.Now-1.week) prices.group_by{ |price| price.created_at.to_date }.each do |k,v| # group by day if v.count > 1 # if many price points that day (v[0]..v[v.size-2]).each {|r| r.delete} # delete all but last record in day end end
在此先感谢您的任何帮助/建议.我将尝试更新,因为我通过它,希望它将帮助有人下线.
解决方法
您可以通过在sql中完成所有操作,并通过将范围限制为上次运行它来提高效率.此外,如果添加一列以将较旧的日终条目标记为“已存档”,则会使查询更加简单.存档价格是您在一周后不会删除的价格.
rails generate migration add_archived_to_prices archived:boolean
迁移之前,请在created_at列上修改迁移到索引.
class AddArchivedToPrices < ActiveRecord::Migration def self.up add_column :prices,:archived,:boolean add_index :prices,:created_at end def self.down remove_index :prices,:created_at remove_column :prices,:archived end end
工作流程将是这样的:
# Find the last entry for each day for each stock using sql (more efficient than finding these in Ruby) keepers = Price.group('stock_id,DATE(created_at)'). having('created_at = MAX(created_at)'). select(:id). where('created_at > ?',last_run) # Keep track of the last run time to speed up subsequent runs # Mark them as archived Price.where('id IN (?)',keepers.map(&:id)).update_all(:archived => true) # Delete everything but archived prices that are older than a week Price.where('archived != ?',true). where('created_at < ?",Time.Now - 1.week). where('created_at > ?',last_run). # Keep track of the last run time to speed up subsequent runs delete_all
最后要注意的是,请确保不要将group()和update_all()组合在一起.使用update_all()忽略group().
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。