Difference between revisions of "Context"

From DocDataFlow
Jump to: navigation, search
(Created page with "A context is a collection of data that is relevant for a particular ''granule''. For example, when a text frame in some document is represented by a Granule|''...")
(No difference)

Revision as of 01:16, 28 December 2013

A context is a collection of data that is relevant for a particular granule.

For example, when a text frame in some document is represented by a granule for processing inside a Crawler personality, it is accompanied by its context.

That context will include information like:

  • what page is the text frame on?
  • what is the text frame position on that page?
  • what document is that page on?
  • ...

The data relating to the text frame is split into two parts: the granule itself, with its own raw data ‘inside’, and additional information about the granule and its surroundings in the context.

The context contains all the other data that’s not part of the granule, but is relevant to it.

Contexts are arranged into a hierarchy.

When we look at a text frame granule, it will probably be a sub-granule of a larger ‘page’ granule. The page granule itself is a sub-granule of a larger ‘document’ granule.

Each of those granules will have its own context. There will be a context for the document granule, and another context for the page granule.

The page context will be a subcontext of the document context: i.e. the page context will include all info from the document context, plus its own specific data.

The text frame context will be a subcontext of the page context: i.e. the text frame context will include all info from the page context, plus its own specific data.

The various adapters in a workflow will often pass information to one another by means of the context.

During the Crawler process, we'll often refer to certain information by name. For example, when processing a template snippet the template text contains placeholders, like $$XPOS$$.

Such placeholders are interpreted within the relevant context. A single snippet will normally be used to process many individual granules; each of the granules will come with its own context, and placeholders like $$XPOS$$ will be replaced by different values every time, depending on what the context dictates for the value of XPOS.

If a certain placeholder is not defined within a particular context, Crawler will check the parent context, and the parent's parent and so on.