I want to process 4714 message-id's to collect some patch stats for the last year. I could collect with #b4 which uses the #lore archive but I wonder does it cache or will I end up with lots of redundant mbx files? Worse still would I just be hammering an overloaded server? I guess I should check out the #PublicInbox docs.
@stsquad Why don't you just download the 12 monthly mbox's? (Well qemu has them, not sure everything else)
@monsieuricon for every Message-Id in the commits for the last year I want to fetch the thread they are from and analyse the patch rev count and meta-data in the commit message.
@monsieuricon it sounds like I should just use public-inbox and #lei to batch the queries and sync into a de-duped Maildir archive. Isn't it better suited to this sort of archive delving?