Choose a configuration approach for Aglyph¶
| Release: | 3.0.0.post1 |
|---|
Aglyph explicitly supports two methods of configuration:
- Declarative XML configuration conforming to the
Aglyph context DTD - Programmatic configuration via The Aglyph Context fluent API
Opinions vary widely on the merits of XML (particularly for configuration), but in fairness there’s also plenty of debate over the merits of “code as configuration” (or “configuration as code” if you prefer) as well. Aglyph strives to not have an opinion one way or the other by supporting either approach.
However, both approaches to Aglyph configuration have strengths and weaknesses that you should understand before choosing one over the other.
Declarative XML configuration¶
An Aglyph context can be defined in an XML document that conforms to the
Aglyph context DTD.
The XML context document is parsed by aglyph.context.XMLContext, which
is just a subclass of aglyph.context.Context that knows how to
populate itself from the parsed XML document. Once populated, this context
can then be used to create an aglyph.assembler.Assembler:
>>> from aglyph.assembler import Assembler
>>> from aglyph.context import XMLContext
>>> my_app_context = XMLContext("my-application-context.xml")
>>> assembler = Assembler(my_app_context)
Note
The
Aglyph context DTD is
provided primarily as a reference. The aglyph.context.XMLContext
class uses a non-validating parser by default.
Developers are encouraged to explicitly validate an application’s context XML document during testing.
See also
- Use a custom XML parser for XMLContext
- This recipe could also be used to force Aglyph to use a validating XML parser.
XMLContext configures mutable builtin objects safely¶
Consider the following example:
<?xml version="1.0" encoding="utf-8"?>
<context id="cookbook">
<component id="cookbook.Example">
<attributes>
<attribute name="mutable">
<list>
<int>1</int>
<int>2</int>
<int>3</int>
</list>
</attribute>
</attributes>
</component>
</context>
Because builtin list objects are mutable, Aglyph will automatically turn
the “mutable” attribute above into an aglyph.component.Evaluator
(which is very similar to a functools.partial). Whenever the
“cookbook.Example” component is assembled, the Evaluator for the
“mutable” attribute is called, which will produce a new list object.
Why is this important? Consider a corresponding programmatic configuration for the same component:
context.prototype("cookbook.Example").set(mutable=[1, 2, 3]).register()
This configuration leads to a (likely) logic error: all objects of the “cookbook.Example” component will share a reference to a single list object. An example illustrates the problem:
>>> example1 = assembler.assemble("cookbook.Example")
>>> example1.mutable
[1, 2, 3]
>>> example1.mutable.append(4)
>>> example2 = assembler.assemble("cookbook.Example")
>>> example2.mutable
[1, 2, 3, 4]
Uh-oh! That’s almost certainly not what we intended. To guard against this behavior, we would need to modify the binding:
from functools import partial
context.prototype("cookbook.Example").set(mutable=partial(list, [1, 2, 3])).register()
Now we will get a “fresh” list every time the component is assembled, so modifying the list on one instance will not affect the lists of any other instances.
(And what if we were actually specifying a list-of-list, or a tuple-of-list, or a list-of-dict? Now we would need to account for mutability of each member!)
This is an easy thing to forget, and can lead to a great deal of (programmatic)
configuration code, which is why
aglyph.context.XMLContext handles it automatically for any <list>,
<tuple>, and <dict> declared in the XML context document.
XMLContext is Unicode-aware and supports automatic character set conversion¶
Aglyph properly handles Unicode text and encoded byte data in XML context documents, regardless of Python version.
Aglyph can also provide your application components with byte data encoded to a user-specified character set.
Consider the following example:
<?xml version="1.0" encoding="utf-8"?>
<context id="cookbook">
<component id="cookbook.TextAndData">
<attributes>
<attribute name="text">
<unicode>ΑΦΔ</unicode>
</attribute>
<attribute name="data1">
<bytes>ΑΦΔ</bytes>
</attribute>
<attribute name="data2">
<bytes encoding="iso-8859-7">ΑΦΔ</bytes>
</attribute>
</attributes>
</component>
</context>
The first thing to notice is that <bytes>ΑΦΔ</bytes> is missing a character
encoding. This can be problematic on Python 2, because the default string
encoding used by the Unicode implementation is typically ASCII:
$ python2.7
Python 2.7.9 (default, Dec 13 2014, 15:13:49)
[GCC 4.2.1 Compatible Apple LLVM 6.0 (clang-600.0.56)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import sys
>>> sys.getdefaultencoding()
'ascii'
>>> from aglyph.context import XMLContext
>>> context = XMLContext("cookbook-context.xml")
Traceback (most recent call last):
...
UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-2: ordinal not in range(128)
One solution would be to add the encoding= attribute. Alternatively, you
can instruct XMLContext to use a different default encoding (it uses the
value of sys.getdefaultencoding() by default):
$ python2.7
Python 2.7.9 (default, Dec 13 2014, 15:13:49)
[GCC 4.2.1 Compatible Apple LLVM 6.0 (clang-600.0.56)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from aglyph.assembler import Assembler
>>> from aglyph.context import XMLContext
>>> context = XMLContext("cookbook-context.xml", default_encoding="UTF-8")
>>> assembler = Assembler(context)
>>> text_and_data = assembler.assemble("cookbook.TextAndData")
>>> text_and_data.text
u'\u0391\u03a6\u0394'
>>> text_and_data.data1
'\xce\x91\xce\xa6\xce\x94'
>>> text_and_data.data2
'\xc1\xd6\xc4'
If we run the same example under Python 3 (which uses “UTF-8” as the default
encoding), we still get correct results, but without the need to explicitly set
the default encoding on the XMLContext:
$ python3.4
Python 3.4.2 (default, Nov 12 2014, 18:23:59)
[GCC 4.2.1 Compatible Apple LLVM 6.0 (clang-600.0.54)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from aglyph.assembler import Assembler
>>> from aglyph.context import XMLContext
>>> context = XMLContext("cookbook-context.xml")
>>> assembler = Assembler(context)
>>> text_and_data = assembler.assemble("cookbook.TextAndData")
>>> text_and_data.text
'ΑΦΔ'
>>> text_and_data.data1
b'\xce\x91\xce\xa6\xce\x94'
>>> text_and_data.data2
b'\xc1\xd6\xc4'
One important thing to notice is the difference in the types of the Unicode and byte strings, dependent upon which version of Python is used.
Unicode and character encoding differences between Python 2 and Python 3¶
The builtin str type has changed significantly between Python 2 and
Python 3 (see Text Vs. Data Instead Of Unicode Vs. 8-bit).
In short: str represented encoded byte data up to and including
Python 2, but representes Unicode text as of Python 3.0:
| Version | Unicode text | Encoded byte data |
|---|---|---|
| Python 2 | unicode |
str |
| Python 3 | str |
bytes |
The Aglyph context DTD defines <bytes>, <str>,
and <unicode> elements that correspond to the types in the table above, but
treats the element content differently depending on the version of Python under
which Aglyph is running:
| Version | <unicode> content |
<str> content |
<bytes> content |
|---|---|---|---|
| Python 2 | unicode |
str |
str |
| Python 3 | str |
str |
bytes |
To summarize the above:
<unicode>is interpreted as aunicodetype in Python 2 and astrtype in Python 3<str>is always interpreted as astrtype<bytes>is interpreted as astrtype in Python 2 and abytestype in Python 3
Note
For clarity in XML context documents, it is always safe to use <bytes>
for encoded byte data and <unicode> for Unicode text (regardless of
Python version), avoiding entirely the ambiguity of <str>.
Warning
Althoug the DTD permits an encoding=”…” attribute on <str> elements,
the attribute is ignored in Python 3 (a WARNING-level log message is
emitted to the aglyph.context.XMLContext channel if it is present).
Programmatic configuration using the Context fluent API¶
New in version 3.0.0.
Objects of aglyph.context.Context now support a chained call “fluent” API.
Please refer to The Aglyph Context fluent API for details.
Custom configuration using Context¶
Do neither declarative XML nor fluent configuration suit your fancy?
An aglyph.context.Context is just a dict that maps component ID
strings (i.e. aglyph.component.Component.component_id) to
aglyph.component.Component instances, so embrace the open source
philosophy and “roll your own” configuration mechanism!
Note
Look at aglyph.context.XMLContext for a starting point - it’s just a
aglyph.context.Context subclass that populates itself from a parsed
XML document.