Skip hidden Microsoft Word files named "~$lename.doc"
Steps to reproduce:
- Have a folder with a few files that have previously been created by Microsoft Word. For a file named
filename.doc
, it created files with the name~$lename.doc
. Those files are hidden under Windows and seem to be some kind of cache for the author or something similar. They do not contain useful data that is not in the originalfilename.doc
file. Those files start with hexadecimal0x14
and have a size of e.g. 162 Bytes. - Have tracker scan the file
- Have a look into syslog.
What happens:
Tracker raises this warning which it writes to syslog:
tracker-extract[41270]: Task for 'file:///home/[username]/path/to/~$lename.doc' finished with error: Could not get any metadata for uri:'file:///home/[username]/path/to/~$lename.doc' and mime:'application/msword'
What should happen:
Tracker should skip the file because it should be hidden anyways, may be left over from already deleted/moved .doc
files and contains no data that is not in the .doc file it belongs to. As a result, tracker should not raise a warning.
Additional info:
This should also be applied to the other binary file formats from other Microsoft Office applications such as Excel (.xls
), PowerPoint (.ppt
) and maybe other file formats. Here is a list of file format specifications.
For ~$lename.docx
files, this issue is not present. This may be due to the fact that tracker tries to parse the file as XML which will immediately fail.
Installed software versions:
libgsf-1.14.47-6.fc37.x86_64
tracker-3.4.1-1.fc37.x86_64
tracker-miners-3.4.1-1.fc37.x86_64
glib2-2.74.1-2.fc37.x86_64
Edited by Christian Stadelmann