Re: [Gimp-developer] GIMP's future internal image data format

From: Bogdan Szczurek <thebodzio gmail com>
To: gimp-developer-list gnome org
Subject: Re: [Gimp-developer] GIMP's future internal image data format
Date: Sat, 28 Jan 2012 20:22:53 +0100

> I don't want to hijack Alexandre's thread with the interesting
> discussion that started therein, so here's a new thread for it.

I think you did a good thing by creating this thread.

> 2012/1/27 Bogdan Szczurek<thebodzio gmail com>:
>> W dniu 12-01-27 10:16, Alexandre Prokoudine pisze:
>>> On Fri, Jan 27, 2012 at 11:01 AM, Martin Nordholts wrote:
>>>> Images shall always be composed in 32-bit floating point
>>>> RGBA
>>
>> Which RGB? Is it scRGB of GEGL "guts"? :)
>
> Hi thebodzio
>
> In the end, yes, GIMP's native image data format will likely be 32 bit
> floating point per component/channel, pre-multiplied linear light RGBA
> with the same primaries and white point as scRGB (and consequently
> sRGB).

I thought so, but wasn't sure. Anyway… I *wanted* to be sure sincegraphics editing without solid colorimetric interpretation is just achild's play :).


>>>> and then have suitable filters and export mechanisms to deal with
>>>> grayscale and indexed images.
>>

>> …and 16 bits per channel… and 8 bits per channel… and bitmaps (1bit)… and

>> multichannel… and CMYKs (I wrote some time ago about them on this list)…
>>
>> I see two problems with that.
>>
>> First is memory use. I'll give an example from my own backyard about

>> bitmaps. Contrary to what may appear bitmaps are still veryimportant image>> mode. At my work I use them as both small and precise means forpreparing>> and storing scanned pages of old books destined to be reprinted.Given I'll>> be compelled to use 32 bits for each pixel I'll practically "waste"31 bits.>> So my images (of course at the time of manipulating them in GIMP)will take

>> 32 times more space than necessary. It may seem harmless, but take into

>> account that such bitmaps are oftentimes of sizes about 8000 x 10000pixels.>> Pretty much the same crude calculations are true for other "pixeldepths" as

>> well (8 bits – 4 times, and so on).
>
> First, I must point out that your arguments needs to be derived from
> the GIMP product vision [1], not personal requirements. Saying "I need
> GIMP to do X" is not a valid argument for adding support for X in
> GIMP, you need to say "GIMP needs to do X to fulfill the product
> vision".

What I'm trying hard to say is: "I need GIMP to do X, *because*…" :).You're right – my opinion is personal and it is inteded as such. I speakout of my experience with graphics and design for printing business(tiny company as it is but still :)). I see it as a discussion betweencraftsman and tool supplier. I need "a hammer shaped this way". Toolsupplier answers "okay, but I think it would be better to adjust theshape". So discussion follows :).

At the same time, I can't emphasize enough that I'm not, by no means,trying to usurp the position of "absolute truth" holder. What is more:I'm strongly against such stance, so if I'm proven wrong or I can'tconvince others… so be it – maybe in time I'll be able to providestronger arguments. Anyway – it would be wrong of me to not to try tohelp in development of app I see as one of the most important for thewhole OSS movement. Also selfish to not share my experiences and findings.

Now… excuse me if I sound a bit heretic or provocative (I simply don'tknow how to ask that question in other way), but… what exactly is GIMP'svision? I mean, I've seen roadmap, features, short info on the gimp.organd developer wiki, yet I couldn't find any piece of text that couldpass as a *paramount* of GIMP's development. Granted I didn't dothourough investigation only a quick look into the matters, but still.Maybe there is something I missed, in that case I'm sorry to bring thatup, but maybe GIMP's vision exists only as a nonwritten idea – clear todevs and frequent followers of this list, but not so for the rest of theworld. If the latter is true, then maybe verbalizing "GIMP's vision"would be a good thing to do? Even if only to put an end to a questionslike mine.


> Otherwise GIMP is doomed to begin (keep?) delivering an
> inconsistent user experience in some areas.

I agree that "fixed point" is needed, but as I undestand it becomes"fixed" as a result of a discussion – hence my post. My arguments may befound insufficient or inconvincing – I'm ready to face that – but I'msure that at least they'll provide some point of view and provoke a bitof "thinking" :).


> There are two solutions to the memory usage problem:
>
> 1. Buy more RAM. GIMP does not need to run well on a 512 MB RAM
> machine because it is reasonable to expect users of a high-end photo
> manipulation program to acquire sufficient amounts of RAM. NOTE: I'm
> not saying GIMP should be wasteful with memory...

I agree completely! I know RAM is easily accessible for a reasonableprice, but, as you've said – it's no reason to be wasteful with it.Especially the latter is my concern.

I want to make a small "reasoning". If each image will be stored (storedall the time – not only "converted" to 32 bits fp during manipulation)as 32 bits fp RGBA (specific colorimetric being non-issue here), theneach pixel ("light" sample) will be represented as qudruplet of 32 bitvalues, so each pixel will require 16 bytes (I assume octet bytes ofcourse). Now I want to consider 24 MP image of size e.g. 6000 x 4000 px.I think such size is reasonable to assume, since full frame DSLRs attops deliver images of similar size. Simple calculation gives: 6000 x4000 x 16 = 384 000 000 bytes (about 366 MiB). It's just "raw" image.But now… add to that a couple of layers ("static" or "indestructive"),some undo levels… Quite recently I had some scanned map to process(RGB). It wasn't the biggest material I had in my hands, even so it wasabout 18000 x 14000. It was a "flat" scan – no alpha, but since weintend to add alpha anyway we can again count it in. If we'd use 32 bitsfp per channel we'd get 3.76 GiB (IMCAC) for starters – just to holdimage in RAM. With such sizes RAM is still an issue.


> 2. Make GIMP clever. If GIMP encounters a tile with only values 0.0
> and 1.0, the 32 bpc data can be transparently, i.e. without the user
> noticing, replaced with 1 bpc data. As soon as more bits of precision
> is required to avoid loss of data, GIMP can transparently convert the
> tile back to RGBA float. The same kind of optimization can be done for
> completely black, white and transparent tiles too.

This is promising but image mode is not just about keeping image istelfas small as possible. It's also to make sure I won't use any colors Idon't want to use. E.g. when I'm editing grayscale, I don't want to bymistake use, say, red. "Smart" mechanism would just "convert" image soit would be able to hold my red and wouldn't even warn me about it. Whybother? It's not so much of a problem with "screen" graphics, but inprint you know exactly that you can use only one colorant for this orthat image (presumably process black) and that's it. If you won't keepyour palette under control you're asking for a problem.

The same story is with bitmap images. If I'm correcting scanned text Iwan't to keep my colors constrained to white and black only.Antialiasing done with more colors looks nice on screen, but in printit'll cause RIP to turn some pixels into halftones, and… there goes yournice, sharp edges – text will seem a kind of "blurred". If I'd, bymistake, used "soft" brush I'd be exactly in that kind of situation. Ifcolors aren't keep in order by image mode (or palette constraints) Iwon't be able to notice some "gray" pixels and "smart conversion" willput image into grayscale. Sure, I can use some additional filters beforesaving or even while saving/exporting, but in former I'd have to remeberto do it and in latter it'll complicate save procedure.


And finally… it can be processing intensive task.

> Maintaining support in GIMP for an internal image data format of 8 bpc
> or adding support for an native image data format of 16 bpc is silly
> because such formats is going to result in rounding errors and lack of
> HDR support, which is not high-end.

HDR or not it'll most of the time end up on (again: most of the time :))8 bit driven display. Besides OpenEXR use exactly 16 bpc (mantissa 10bits to be exact). HDR power is not in images that inherently lookbetter but in greater capability of processing them (e.g. about +- 30 EVin OpenEXR!).

>> I understand "filters" and "export mechanisms" are to be basicallymeans of>> "cramming" information from fp pixels into "less precise" units. Itmeans>> it's very probable some information will be lost/distorted. It willhappen>> in more or less dramatic way, but the problem remains: what tools dowe need

>> to give to user to enable them *complete* and *precise* control over

>> "conversion" results (meaning: this pixel should have *exactly* thisor that>> value). When you stick to "image modes" as your paradigm in appitself, you>> don't have to worry much about it – user "gets what he sees"(granted he's>> into proper CM workflow ;)) or at least have absolute control oversample>> values right "on the canvas". Moving this control from "canvas" to"export"

>> will result in another layer of complexity with (probably) reduced

>> capabilities for the sake of "simplicity". Most of the time it'llprobably

>> work (much like CM), but I think it'll be a hell to debug and enemy of
>> marginal, yet important use cases.
>
> If you want a pixel to be RGB u8 (128, 90 90) when you export to a
> PNG8, simply paint that pixel RGB u8 (128, 90 90). There are no
> problems for RGBA float to represent RGB u8. Maybe I don't fully
> understand what you mean, could you give a concrete and clear use case
> that illustrates a problem?

That's the problem too. Let's assume we have RGBA as internal sampleformat. Each channel is 32 bits fp. It means value of each channel isrepresented by (IEEE 745): 1 bit of sign, 8 bits of exponent, 23 bits ofreally significant value. Now, how do we interpret this valuecolorimetrically?

We can say: 0 – dark, 255 – bright (assuming additive color model andleaving some less significant details aside). It would mean that between0 and 1 we use 1 bit of our "value space" for integer, leaving 22 bitsfor mantissa. Between 255 and 254 however, we have to sacrifice 7 bitsof significand for integer value and 16 bits for mantissa. It's 16 vs.22 bits of precision between values. 16 bits is still plenty of values,but still less than 22. So "subvalues" would be unevenly distributed.Will it affect resulting image and when? I don't know, but it's good totake this into account and test it.

We can also state: 0 for dark, 1 for bright (maximum). In this case,we'd have evenly distributed "subvalues" (beside 0, 22 bits ofsignificand), but then again we'd have most of exponent bits practicallyunused. These bits could be otherwise put in use to make value even moreprecise. Having said all that I wonder if a) such precision isn't anoverkill, b) wouldn't we be better off with integers.

Even if we'd disregard all else. Let's assume we use RGBA, 32 bits fpper channel and disregard "color mode" at all. What about some reallyuseful color models, like Lab, XYZ, HSV, HSL, CMYK and also"multichannel"? Especially in two latter, there are situations whenyou'd need to set "this channel" to "that value exactly". If all data isRGB, how we'd provide this possibility? We'd have to temporarily derivechannel value from RGB, modify it, and convert back to RGB. And all thatassuming that all operations are bijective between RGB and other colorspace which in practice simply *isn't true* most of the time.

I really want to see at least CMYK and multichannel in GIMP. Withoutthem we can hardly think about using GIMP in serious (though rare)applications in publishing workflows. I won't say of it more (if notrequested) – it was discussed before, maybe it's just not GIMPdevelopment objective :).


> / Martin

Great to have constructive discussion with you!

My best!
thebodzio

Follow-Ups:
- Re: [Gimp-developer] GIMP's future internal image data format
  - From: Martin Nordholts

References:
- [Gimp-developer] GIMP's future internal image data format
  - From: Martin Nordholts

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]