Could not insert metadata for JPEGs with bad UTF-8: log spam
I seem to have some JPEGs that cause tracker to choke - they contain metadata with bad UTF-8. The logs are getting spammed with this problem every 20s or so which I had hoped was solved in #6 (closed) but guess not(?)
Tracker versions:
tracker-2.1.1-1.fc28
tracker-miners-2.1.1-1
In the logs, I get lots of these:
Aug 23 10:58:03 heyho tracker-extract[5374]: Could not insert metadata for item "file:///home/me/Pictures/Test/HPIM0064.JPG": 41.43: invalid UTF-8 character
Running tracker extract
against the file by hand with env var TRACKER_VERBOSITY=3 you can see the bad stuff in urn:equipment (7th line):
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
@prefix nmm: <http://www.tracker-project.org/temp/nmm#> .
@prefix nie: <http://www.semanticdesktop.org/ontologies/2007/01/19/nie#> .
@prefix nfo: <http://www.semanticdesktop.org/ontologies/2007/03/22/nfo#> .
<urn:equipment:Hewlett-Packard:HP%20Photosmart%20M22%20(V01.00)%20d%FD%ED%DE%FF%01:> nfo:manufacturer "Hewlett-Packard" ;
nfo:model "HP Photosmart M22 (V01.00) d����" ;
a nfo:Equipment .
<file:///home/shirkin/Pictures/Test/HPIM0064.JPG> nmm:exposureTime 0.016666666666666666 ;
nfo:equipment <urn:equipment:Hewlett-Packard:HP%20Photosmart%20M22%20(V01.00)%20d%FD%ED%DE%FF%01:> ;
nmm:fnumber 2.7999999999999998 ;
nmm:isoSpeed 100 ;
nmm:flash nmm:flash-on ;
nmm:meteringMode nmm:metering-mode-center-weighted-average ;
nie:contentCreated "2006-11-19T00:35:23-0800" ;
nfo:verticalResolution 300 ;
nfo:horizontalResolution 300 ;
nfo:width 1728 ;
nmm:focalLength 6.0999999999999996 ;
nmm:dlnaProfile "JPEG_LRG" ;
a nfo:Image , nmm:Photo ;
nmm:dlnaMime "image/jpeg" ;
nfo:height 2304 ;
nmm:whiteBalance nmm:white-balance-auto .