Duplicate emails with different text body
Hi there all,
I am having an issue with deduplicating two email messages. We use and
MD5 hash of various mapi properties, including the text version of the
body of the email.
One of the messages comes from a PST file, the other is exported
directly from an exchange EDB file.
The messages are identical in every sense, same sent date and time,
same author, same recipients same subject and the RTF version of the
email is exactly the same.
However, the text version of the body of the email differs slightly in
that the section of the email that refers to "-----Original
Message-----" in one instance is preceded by "" characters, and in the
other instance is not.
Can anyone explain how, after the message has been sent from a
particular user that the format of the body of the message can change
in this way?
To make it clearer, one email is in the sent items of the originator of
the message, the other message is in the inbox of the recipient of the
message.
Any help appreciated.
Many thanks,
Martin.
|