Schematron Explained

Schematron is a rule-based validation language for making assertions about the presence or absence of patterns in XML trees. It is a structural schema language expressed in XML using a small number of elements and XPath languages. In many implementations, the Schematron XML is processed into XSLT code for deployment anywhere that XSLT can be used.

Schematron is capable of expressing constraints in ways that other XML schema languages like XML Schema and DTD cannot. For example, it can require that the content of an element be controlled by one of its siblings. Or it can request or require that the root element, regardless of what element that is, must have specific attributes. Schematron can also specify required relationships between multiple XML files. Constraints and content rules may be associated with "plain-English" (or any language) validation error messages, allowing translation of numeric Schematron error codes into meaningful user error messages. Users of Schematron define all the error messages themselves.[1]

The current ISO recommendation is Information technology, Document Schema Definition Languages (DSDL), Part 3: Rule-based validation, Schematron (ISO/IEC 19757-3:2020).

Uses

Constraints are specified in Schematron using an XPath-based language that can be deployed as XSLT code, making it practical for applications such as the following:

Adjunct to Structural Validation: By testing for co-occurrence constraints, non-regular constraints, and inter-document constraints, Schematron can extend the validations that can be expressed in languages such as DTDs, RELAX NG or XML Schema.[2]
  • Lightweight Business Rules Engine: Schematron is not a comprehensive, Rete rules engine, but it can be used to express rules about complex structures with an XML document.
  • XML Editor Syntax Highlighting Rules: Some XML editors use Schematron rules to conditionally highlight XML files for errors. Not all XML editors support Schematron.
  • Versions

    Schematron was invented by Rick Jelliffe while at Academia Sinica Computing Centre, Taiwan. He described Schematron as "a feather duster to reach the parts other schema languages cannot reach".

    The most common versions of Schematron are:

    Schematron as an ISO Standard

    Schematron has been standardized by the ISO as Information technology, Document Schema Definition Languages (DSDL), Part 3: Rule-based validation, Schematron (ISO/IEC 19757-3:2020).

    This standard is currently not listed on the ISO Publicly Available Specifications list. Paper versions may be purchased from ISO or national standards bodies.

    Schemas that use ISO/IEC FDIS 19757-3 should use the following namespace:http://purl.oclc.org/dsdl/schematron

    Sample rule

    Schematron rules can be created using a standard XML editor or XForms application. The following is a sample schema: Date rules ContractDate should be in the past because future contracts are not allowed.

    This rule checks to make sure that the XML element has a date that is before the current date. If this rule fails the validation will fail and an error message which is the body of the assert element will be returned to the user.

    Implementation

    Schematron schemas are suitable for use in XML Pipelines, thereby allowing workflow process designers to build and maintain rules using XML manipulation tools. The W3C's XProc pipelining language, for example, has native support for Schematron schema processing through its "validate-with-schematron" step.

    Since Schematron schemas can be transformed into XSLT stylesheets, these can themselves be used in XML Pipelines which support XSLT transformation. An Apache Ant task can be used to convert Schematron rules into XSLT files.

    There exists also native Schematron implementation, like the Java implementation from Innovimax/INRIA, QuiXSchematron, that also do streaming.

    See also

    External links

    Notes and References

    1. Book: Siegel, Erik . Schematron: a language for validating XML . 2022 . XML Press . 978-1-937434-81-6 . Denver, CO.
    2. Schematron - More useful than you'd thought. Philip. Fennell. June 2014. XML London 2014. 10.14337/XMLLondon14.Fennell01. 103–112. 978-0-9926471-1-7. free.