14 comments

  1. Jim says:

    Nice article, thank you. I have a question though about WordPress and metadata. When WordPress makes versions of your images at smaller sizes, does the metadata, like copyright and author, stay with the new images? Often it’s a smaller version used in a page, so it wouldn’t be good if those data were missing.

  2. Carl Seibert says:

    Hi Jim,

    >so it wouldn’t be good if those data were missing.

    Darn right!

    Whether WordPress honors metadata on the various versions of the file create when you upload to Media Library depends on imaging library your server is using. By default, WordPress will use ImageMagick, if it is installed and activated. If that’s the case, you’re good to go. All your versions will have proper metadata. There’s a bunch of information on this in this post: https://www.carlseibert.com/wordpress-honors-metadata-sort-of/

    If ImageMagick isn’t on your server (Like, say, here. We haven’t got it set up on the server that serves this site yet, sadly.) WordPress will use the GD library (that’s the name, not what I think about it) that ships with PHP. GD destroys metadata. Doesn’t do as well with images themselves, either. If you’re stuck with GD, there is a workaround, which is described in this post: https://www.carlseibert.com/wordpress-metadata-workaround/

    Upcoming, I plan on doing a chart of hosting providers that support ImageMagick.

    • Carl Seibert says:

      Good work! I took the liberty of saving a copy of your chart for future reference.

      It’s a pity Microsoft appears to be making progress in the wrong direction.

      When I was working on my Creative Commons posts, I came across several documents from about the time Adobe released XMP as an open standard that were all full of optimism about how everybody was going to migrate to XMP and it would all make sense – very soon – from that point in 2009 or so. Microsoft was mentioned. Of course, it hasn’t exactly worked out that way 🙂

      • Sure, it is a Google Docs document, I am still finding out some behaviors and revise it. Most recently I added some old Digital Image Library keyword tags which WPG also reads, and then, interestingly, moves them to xmp when edited.

        For quite some time I have been digitizing all of my family’s photos, painstakingly cataloging and adding metadata as a hobby. I started using Microsoft Digital Image Suite’s Library app which later evolved into Windows Photo Gallery, which I still use today even if it is now unsupported and adding “People Tags”. I also use GeoSetter, which I highly recommend for geotagging and also adding other metadata. Among the things which I like about GeoSetter is that it uses exiftool and allows you to execute post editing exiftool commands which I use to ensure certain tags are in sync.

        I post from time to time about my metadata struggles on my blog – https://jmoliver.wordpress.com

  3. Joep Bord says:

    Now that I want to start a database for my pictures I came across your fine article. I just started to struggle in comparing all kind of programs. I need two things. A good metadata system and a good program to treat my ARW and CR2 RAW-files with the possibility of local editing. I prefer to combine those two wishes. For metadata I followed your advise to use DigiKam or XnView.
    I found out that I can use Darktable for RAW editing. The problem is that it is very hard to let this programs interchange data with DigiKam. Darktable reads all the keywords, titles etc from Digikam, but after changing/removing a keyword in Darktable, Digikam doesn’t read that back.
    In your comparison you didn’t mention Darktable which is an interesting program.
    What do you think of this program and do you know a solution for this problem It can be one program that does both or a good combination of two programs. What about Capture One for example. Quite expencive for private (but serious) use, but cheaper then Lightroom

    • Carl Seibert says:

      Hi Joep,

      I’ve only briefly played with Darktable, so I’m of no help there.

      Since you mention Capture One, I’m going to guess that you’re on Windows or Mac. In that case, I would commend ON1 Photo RAW to your consideration. It’s an all-in-one, like Lightroom or Capture One, but it has some advantages that I take seriously over either. By default, it does not depend on a central database for its non-destructive editing information. It handles that in sidecar files. That means that you can work on photos and archive them any way you want without having to worry about that database. ON1 is completely standards-compliant with metadata, and its metadata functionality is efficient to use. On the archiving side, you simply set watch folders and it catalogs anything that’s put in them (like the way DigiKam works). I like its RAW editing, too. It’s the program I actually use. I don’t usually use it alone. I do a lot of stuff in Photo Mechanic, which works perfectly with ON1. ON1 is inexpensive. Right now, it’s only $100 USD, and frequently they have it on sale for even less.

      -Carl

      • joep Bord says:

        Hi Carl
        Thanks for your advise. Yes I am on Windows. I will try ON1. BTW I found out that Darktable also makes sidecar files like DigiKam. Although the XMP files have a different structure, there is an interchange of tags/labels between the XMP files of Darktable and DigiKam. You can find the tags made in DigiKam in the XMP files of Darktable and vice versa. But Darkroom itself doesn’t show up with the tags and titles made in DigiKam.
        Both DigiKam and Darktable have problems with connecting my Sony a7III.
        Darktable and camera work reasonable with tethering but not with bringing files direct from camera to the program/computer.
        Joep

      • Joep Bord says:

        Dear Carl,
        I followed your suggestion and tried ON1. It was worthwile a try. ON1 does a perfect job considering metadata and has some cool editing features. The problem with ON 1 is that it is to slow in editing and uses to much computer capacity. The CPU is used often for about 95 % while the GPU is used max 25 %. At this moment I prefer other programs for editing in combination with Digikam or XnView for searching. I hope ON1 will develop soon as a perfect solution. It is promising but not yet there.
        Joep

        • Carl Seibert says:

          Indeed, ON1 is not the fastest bunny in the race. I find it exports significantly slower than Lightroom and (along with every other program, WAY slower than Capture One.) Although, frankly, the slow export hasn’t really bothered me. I don’t see it hogging machine resources on my laptop. Right after starting the program, while it is updating my cataloged folders (of which I have a fair number), it will make the fans spin fast. But it that scan is a low priority process and it doesn’t seem to affect anything, including ON1 itself. Good news is that they make it faster with each release. So, maybe someday….

  4. hackerb9 says:

    Thanks for the informative post!

    I am writing a command line program to quickly view and delete images
    within a terminal window. I wanted to put in the ability to edit
    comments, but had no idea what a can of worms I was opening. After
    reading the IPTC recommendations I was even more confounded —
    “headlines” sound more like what I call “titles” and “titles” are what
    I’d call “filenames” — but this blog post did help, somewhat.

    I do have a couple questions (for you or your knowledgable readers):

    1) What is your suggested order for reading metadata? I was going to
    go with XMP > IIM/IPTC > EXIF, but it seems like I’d be in the
    minority. And, since some programs you mentioned cannot actually
    write XMP, it seems like the EXIF or IPTC data has a better chance
    of being current.

    2) Which tags should I make it easy for people to edit and what should
    I call them? “Caption” (AKA “Description”) seems important, but
    what about “Comment”, “Title”, “Headline”, “Subject”, “Label”, etc?
    Which ones do people actually use? And how do the tag names people
    are familiar with map to the underlying XMP/IPTC attributes?

    • Carl Seibert says:

      Hi,

      Thanks for reading!

      For a reading order, I usually recommend a simple XMP > IIM > Exif. The machinations in the Metadata Workling Group guidelines of 2010, in addition to being complicated, were predicated on assumptions that don’t make sense anymore. (To me, anyway.) Back then, it was assumed that the world would move quickly to XMP and IIM would soon be deprecated. Hasn’t happened. In that world, it was assumed that rogue, out of sync, data would generally be written by an archaic IIM-only-writing program. In today’s world, out-of-sync data is just as likely to be written in the XMP. Exif has always been a political football.

      My logic is that XMP doesn’t pose tag length restrictions and is thus more likely to be authoritative than the possibly truncated IIM. If any descriptive data exists in the Exif, it was probably put there by the camera, would most likely have been superseded by, or transferred to, the IPTC fields. If it’s out of sync, it’s pretty likely NOT to be as authoritative as the more recently written IPTC. Few programs can read it anyway. It’s just not where descriptive data is SUPPOSED to be.

      I have a post here describing all the fields and, IIRC, their many and various names through the years. I have to confess I don’t do much to help the confusion. I often refer to fields by their traditional names, rather than their proper ones.

      (It also doesn’t help that ExifTool uses it’s own naming convention, with IIM fields usually called by their old school names and XMP ones by the new names. The fields, of course, are the very same fields. The field is one entity in the schema; it’s just written in duplicate.

      Fields that I think matter: Caption/Description is obvious. As are the Creator and Copyright fields.

      (Google Images supports Creator, Copyright, and Credit, BTW. Credit is poorly understood and rarely used outside of the publishing and stock photo industries. That Google chose to include it is a testimony to the fact that they were, shall we say “urged” to do the right thing by a pending lawsuit by a company entrenched in both of those industries.)

      Keywords is important, too. I would avoid dallying with the special Lightroom hierarchical keywords field. Just the normal ones will do fine. It’s very important to always put keywords in the standard IPTC place. There are unfortunately programs out there that write them to idiosyncratic places, causing tragic data losses when people migrate away from them.

      Title/Object Name and Headline are interesting in that they are used differently by different organizations. Headline is supposed to be just exactly that. Object Name is supposed to be a “slug”, a human-readable not necessarily unique identifier for the image. But usage in the real world is all over the place. I doubt most individual photographers use either one of them.

      “Subject” is the XMP place for keywords. It is synchronized with the IIM “Keywords” field. The pair are always labeled “Keywords”.

      Rating and Label are the fields that carry the star rating and color label fields that most GUI programs use. Both live in the XMP. Image Rating is an official IPTC field and its use is standardized. Label is not part of the standard and its use is all over the place. I recommend using the Adobe Lightroom color set as it’s something of a lowest common denominator. (I covered labels in, I think, the Photo Mechanic 6 post or video.)

      Comments is actually not an IPTC field. It’s in the Exif and is not terribly widely supported. It makes no sense, frankly. The Exif is generally written to by the camera and how is a camera going to comment on anything? The closest thing to a real comments field is Special Instructions. (Which, refreshingly, is never called anything else.)

      Now that I’ve brought it up, there are three Exif fields that are semantically equivalent to IPTC ones, “Artist”, which is Creator by yet another name, Copyright, and Description. Caption/description is useless because cameras can’t write anything useful there. Cameras can do just fine with Copyright and Creator because those are standing values, the same for every picture. But information in those fields doesn’t do much good unless it is later transferred to the proper IPTC fields.

      The IPTC standard has the current official names for all of the fields and you can glean from it which fields are synchronized between the IIM and XMP. (There’s actually a chart in the standard. But it omits a couple of obscure fields that are actually synchronized, so it isn’t as authoritative as poring over the field descriptions themselves.)

      There is a link here to download the current IPTC standard. Just use the search function.

      And I would be happy to check over your work if you’d like.

      -Carl

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.