social.kernel.org

Conversation

Jarkko Sakkinen

Now that I’m already using maildirs again, I could fulfill the lacking feature from #aerc (compared to #mutt) using #notmuch, which de-duplicates my emails:

#!/usr/bin/env bash
#
# Copyright (c) Jarkko Sakkinen 2024
# JSON queries ripped from https://github.com/esovetkin/notmuch-deduplicate

set -e

QUERY='*'

notmuch show \
  --format=json \
  --entire-thread=false \
  --body=false \
  "${QUERY}" | \
  jq \
    -n \
    --stream 'fromstream(1 | truncate_stream(inputs))' | \
  jq -r '.. | .filename? //empty | @tsv'  | \
  grep '\t' | \
  awk -F'\t' '{for (i=2; i<=NF; i++) print $i}' | \
  xargs -I{} rm -v "{}"
notmuch new

# vim: filetype=vim ts=2 sw=2 et

#email

Jarkko Sakkinen

jarkko

1 year ago

Reply to

@jani How? I just started to use it.

Jarkko Sakkinen

jarkko

1 year ago

Reply to @jarkko

@jani And it does work ;-) That is always a a good state to begin with. I actually wanted to learn notmuch only because aerc does not have ~= query of mutt.

Jarkko Sakkinen

jarkko

1 year ago

Reply to @jarkko

@jani The most optimal would be something that would be just notmuch search query because :query command in aerc is able to do virtual folder of the messages. Then I could just add a binding to aerc.conf and would not need a script in the first place.

David Bremner

bremner@mathstodon.xyz

1 year ago

Reply to

@jani @jarkko not really, other than that I don't have any reliable way (not involving me reading both messages) to tell if two messages are really the same, or just have the same message-id because some "enterprise" system re-uses the same message-id for every message.

Jarkko Sakkinen

jarkko

1 year ago

Reply to @bremner@mathstodon.xyz

@bremner @jani I’m wondering here, did ~= in mutt do full body compare…

I’m also thinking that as aerc has this:

:query [-a <account>] [-n name] [-f] <notmuch query>
Create a virtual folder using the specified top-level notmuch query. 
This command is ex clusive to the notmuch backend.

[man aerc]

Maybe one possibility would be feasible to implement directly into aerc :duplicate which would similarly create a virtual folder for duplicates, which would then allow interactively decide the faith (as it is for filter and query). This could potentially then do full body compare as it was within the implementation in Go.

Jarkko Sakkinen

jarkko

1 year ago

Reply to @jarkko

@bremner @jani mutt just has a hash table of message ID's that it uses (pattern.c, thread.c and hash.c in its src tree).

David Bremner

bremner@mathstodon.xyz

1 year ago

Reply to @jarkko

@jarkko @jani sure. De-duplicating message-ids is easy. De-duplicating messages is hard, if you are worried about false positives.

Jarkko Sakkinen

jarkko

1 year ago

Reply to @bremner@mathstodon.xyz

@bremner @jani Right, in mutt it was nice because you could limit the view with ~= based on message ID's and quickly delete the most obvious ones and check manually rest.

So, can notmuch form a query that would be 1:1 match to what ~= does in mutt? In that case I can use that together with :query command in aerc.

Jarkko Sakkinen

jarkko

1 year ago

Reply to @jarkko

@bremner @jani Anyway thanks for the comments! I now at least know what the problem is so this helped.

David Bremner

bremner@mathstodon.xyz

1 year ago

Reply to @jarkko

@jarkko @jani I think what Jani showed is the closest we get. It is unfortunately not part of the query language proper, so needs the CLI.

Jarkko Sakkinen

jarkko

1 year ago

Reply to @bremner@mathstodon.xyz

Edited 1 year ago

@bremner @jani I'm actually at least looking if I could PoC it as a feature to aerc code base. Have to grow some motivation first I've never used Go for anything ;-)

I did read the mutt implementation through and it is not really rocket science.

I also realized (based on this discussion) that the interactive flow in mutt was the thing (to address also false positives issue) and that's why I liked it.

David Bremner

bremner@mathstodon.xyz

1 year ago

Reply to @jarkko

@jarkko @jani You can see in

https://git.notmuchmail.org/git?p=notmuch;a=blob;f=notmuch-search.c;h=327e144564de48e0b339036528505d5a227bc40a;hb=HEAD#l579 what we do is pretty simple minded, scan the list of file names for a given message-id.

Jarkko Sakkinen

jarkko

1 year ago

Reply to @bremner@mathstodon.xyz

@bremner @jani thanks!

Jarkko Sakkinen

jarkko

1 year ago

Reply to @jarkko

@bremner @jani

OK, so aerc has {{.MessageId}}, which can be e.g. to :term b4 am {{.MessageId}}. In any command parser substitutes that with the message ID of currently selected message. It also has notmuch support.

So I should like a make notmuch query (which aerc should support) id:{{.MessageId}} and see if that gives me the set of messages with the same message ID.

About social.kernel.org

Terms of service

Please do not use this service in violation of the Linux Kernel Code of Conduct. Doing so will result in your account suspension with the referral of the matter to the CoC committee.
"Repeating"/"boosting" someone else's status on this platform will be treated as endorsement and will fall under rule #1.
You are encouraged to use this platform to promote your work on the Linux Kernel, but there is no restriction on permitted topics (with the exception of anything covered by #1 above).
There is no requirement to post in English, but it should be considered the primary language of communication on this platform.

Privacy notice

The admins of this service have access to all posted statuses. They aren't looking, but if it's something they shouldn't know about, then you should not post it on this platform.

Please see the Linux Foundation Privacy Policy, which applies to this platform as well.

Getting your own account

If you would like an account on this instance, please check that the following applies to you:

You are listed in MAINTAINERS or CREDITS
OR: You have a kernel.org account or email address
OR: You have a long and established history of involvement with the Linux Kernel

If the above is true and you agree with the Terms of Service and Privacy Notice listed above, please use these instructions to request an account:

How to request an account on social.kernel.org