[otlLib] Design goals and API #468

behdad · 2016-01-19T12:23:52Z

Starting this thread, to discuss what we want otlLib to be, while it's still tiny and easy to change. I'll put in my thoughts as additional comments.

@brawer @anthrotype @moyogo

behdad · 2016-01-19T12:31:46Z

So, one thing I think we can all agree on is, we don't need a new set of types... So, let's agree that otlLib API converts a set of Python structs (tuple, set, dict) to otData-defined objects and back.

Also, would be easier if the data that goes into each object, is fully defined by one item. Ie, buildSingleSubst() etc are already like that: they take one parameter (mapping) that contains all the data for this data structure. buildAnchors() on the other hand, is currently not like that, as it has this API:

def buildAnchor(x, y, point=None, deviceX=None, deviceY=None)

I think it's fine to continue using named arguments, but perhaps extend it to also have a one-argument mode. Eg, change it to:

def buildAnchor(pos, point=None, deviceX=None, deviceY=None)

where pos is an iterable. If it has two items, those will be x,y. If it has three, the third will be point and point argument must be None. If it has four, the third and fourth will be deviceX/deviceY.

Also, for simple types like Anchors, Values, Device, ..., the rest of the API will accept ot objects for this as well as raw data. Ie, in a buildCursivePos(), you can pass results of buildAnchor(), or pass the one-item input to buildAnchor() directly to buildCursivePos() and it will take care of it. Ie, we'll convert a tree of data items all with one call.

We then add "unbuild" API to reverse this and return the one-item data for each object type. This allows us to go from binary to a representation that is, for lack of a better word, lisp-like. There are many benefits to this: this representation is for the most part, immutable, can be directly compared and printed, and has no noise in there. It only encodes the semantics of the objects involved, no extra attributes, or other object stuff that needs to be filtered out. It lends itself very well to optimizations, merging, diffing, and other such routines.

behdad · 2016-01-19T12:36:47Z

CC @typesupply @adrientetar

behdad · 2016-01-19T13:14:02Z

Another discrepancy between feaLib and mtiLib is worth addressing in otlLib before we put much more code into it:

The current buildSingleSubst, buildMultipleSubst, ... build a subtable at a time. That's what mtiLib encodes, and this prevents certain optimizations from happening. In particular, when user calls buildSingleSubst() to build a subtable, there's only one way to do that, and the compiler uses Format 1 if all substitutions have the same glyphID delta, and Format 2 otherwise.
In feaLib however, it's up to the builder to break the lookup into as many subtables as needed, to produce most optimal output. Indeed, if you have a singleSubst lookup and have the font around, you can calculate the glyphID delta for all the substitutions, and if a significant portion have the same delta, encode those a separate subtable. We can (and should?) even have a compression parameter. The higher the compression number, the smaller the font size be, but slower to build, and slower to shape, so we shouldn't go crazy about it. Moreover, if we have this functionality, it's rather trivial to build a GSUB/GPOS optimizer on top of it.

So, this is what I like to suggest:

Rename buildSingleSubst(), ... to buildSingleSubstSubtable(). These will be very closer to what's in mtiLib (ie. no optimization whatsoever),
- Add buildSingleSubstSubtables(), that returns a list of subtables. It takes data in in the exact same format as the single-subtable case, but optimizes number of subtables. I don't know if the name is too confusing or not. I don't want to call it buildSingleSubstLookup(), as I don't think we want to build the actual ot.Lookup() here. That can stay one level up.

We'll then add buildLookup() that takes a tuple that has lookup-type ("singleSubst", "singlePos", etc), lookup flags + markfilteringset, and tuple / list of lookup subtable data items.

Comments?

brawer · 2016-01-19T13:28:48Z

Are you sure that mtiLib is correct? If the ValueRecords have different value formats, doesn’t mtiLib need to emit multiple subtables as well? (In SinglePos format 2, the format of all ValueRecords needs to be the same). To make this more concrete, I’ve implemented SinglePos according to what we had discussed last week.

behdad · 2016-01-19T13:38:40Z

Are you sure that mtiLib is correct?

Yes, you can just extend valuerecords with missing items set to 0/None. In fact, ot compiler layer does that. So, you just bitwise-or their value formats together. It works.

behdad · 2016-01-19T13:39:33Z

So what you implemented is what I called the optimizing one. Indeed, this whole issue that we need to separate the two was brought to my attention by you showing me your SinglePos code.

behdad · 2016-01-19T13:40:24Z

Are you sure that mtiLib is correct?

Yes, you can just extend valuerecords with missing items set to 0/None. In fact, ot compiler layer does that. So, you just bitwise-or their value formats together. It works.

Needless to say, this is not necessarily the optimal packing... But that's the way mti files are supposed to work. The author takes care of sorting similar adjustments into the same lookup / subtable.

brawer · 2016-01-19T13:41:58Z

Should otlLib._buildSinglePosFormat1/2 be public?

behdad · 2016-01-19T13:44:40Z

Yes. But I prefer that it chooses format automatically. If you see otTables.py, that's what SingleSubst, etc do, just in a different layer. You might have an optional Format that is set to None by default, but I don't see a need for that. Just encode the data in whatever one subtable format that is more compact.

behdad · 2016-01-19T13:45:20Z

That's what mtiLib does BTW. Please check parseSinglePos in mtiLib.init.py

brawer · 2016-01-19T13:52:21Z

Confused. If mtiLib just passed its mapping into otlLib.buildSinglePos(), won’t you get exactly what you want? If not, it might be best to hash this out in person...

brawer · 2016-01-19T14:20:30Z

We hashed it out over a VC. Will submit a series of changes.

behdad · 2016-01-19T14:21:08Z

Decided to name the version that creates one subtable, well, buildSinglePosSubtable(). And the one that returns a list of subtables, buildSinglePos().

#468

brawer · 2016-01-19T22:00:06Z

Should a function like otlLib.buildAttachList() return None for empty input? Personally I’d say yes: it simplifies call sites, which makes the library easier to use. See d1fd788 for an example. Same question for other tables.

behdad · 2016-01-19T22:20:33Z

I agree returning None is useful. Maybe have an argument allowEmpty=False? Or let's wait until someone requests.

Also, buildAttachPoint() can be public as well. Functions to build any OT type are acceptable API.

This simplifies call sites when building GDEF tables. Also, publicly expose the buildAttachPoint() function. #468 (comment)

behdad mentioned this issue Jan 19, 2016

Added otlLib #469

Merged

brawer added a commit that referenced this issue Jan 19, 2016

[otlLib] Implement otlLib.getSinglePosSubtable()

ce7cc43

#468

brawer added a commit that referenced this issue Jan 19, 2016

[otlLib] Return None for empty argument to buildAttachList()

1bb757e

This simplifies call sites when building GDEF tables. Also, publicly expose the buildAttachPoint() function. #468 (comment)

anthrotype mentioned this issue Jan 23, 2016

ft2fea-feature file dumper #479

Open

m4rc1e mentioned this issue Oct 4, 2017

gsub_diff: Compare LookupTypes 2, 3, 4, 7 notofonts/nototools#455

Open

behdad closed this as completed Jul 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[otlLib] Design goals and API #468

[otlLib] Design goals and API #468