Search

Example query

A search query uses HTTP POST to provide an XML document containing the search criteria as well as information about the response to provide.

The request MUST have a Content-Type: application/xml header.

The following shows for example a search from the demo Bookshelf form:

<search>
    <!-- Free-text search query -->
    <query/>
    <!-- Structured search query -->
    <query name="title"
           path="details/title"
           label="Title"
           type="xs:string"
           control="input"
           search-field="true"
           summary-field="true"
           match="substring">Peace</query>
    <query name="author"
           path="details/author"
           label="Author"
           type="xs:string"
           control="input"
           search-field="true"
           summary-field="true"
           match="substring"/>
    <query name="language"
           path="details/language"
           label="Language"
           type="xs:string"
           control="select1"
           search-field="true"
           summary-field="true"
           match="substring"/>
    <drafts>include</drafts>
    <!-- Paging -->
    <page-size>10</page-size>
    <page-number>1</page-number>
    <!-- Language -->
    <lang>en</lang>
</search>

Query elements

The query element is the most complex element. It is used for full-text and structured search. Full-text search and structured search are exclusive: either Form Runner performs one, or the other.

Full-text search

If the user is performing a full-text search, the first <query> element (which does not have attributes) contains the text of the search. The result details to return is still determined by the subsequent <query> elements with summary-field set to true. See below for more information about the response format. The exact semantic of the full-text search is implementation-dependent.

Structured search

When building a structured search query, Form Runner first looks at all controls in the source form that have the class fr-summary or fr-search. These are used to build the request.

For each such control found, a <query> element is added to the request, with the following attributes and text value:

  • name attribute:
    • contains the control name as entered in Form Builder
  • path attributes:
    • contains the path to the XML element or attribute storing the data in XForms, relative to the root element. A forms may contain repeats, which means controls may have multiple values. The path expression generated by Form Runner, and provided to the persistence layer, contains [1] after element names, to guarantee it only returns one node. This is done to make it easier to implement the persistence layer for databases like MySQL that don't allow multiple values to be handled in a custom way. (Specifically with MySQL, if multiple values are found, extractValue always returns a space separated string.)
  • label attribute:
    • informative only
    • the control's label in the current language used by Form Runner
  • type attribute:
    • contains the datatype associated with the field in Form Builder
    • default: xs:string
    • other possible values include
      • optional/mandatory string: xforms:string/xs:string
      • optional/mandatory date: xforms:date/xs:date
      • optional/mandatory time: xforms:time/xs:time
      • optional/mandatory dateTime: xforms:dateTime/xs:dateTime
    • other values are possible, including XML Schema data types as well as user-defined data types
  • control attribute:
    • informative only
    • the XForms control type used in the form, including:
      • input
      • textarea
      • secret
      • select
      • select1
      • range (rarely used)
  • search-field attribute:
    • whether this is a search field
    • if true, the field might contain a search string entered by the user as text value
    • NOTE: Only the controls with class fr-search are shown to the user to enter search criteria.
  • summary-field attribute:
    • whether this is a summary field
    • if true, search must return a result detail value fo this field (assuming there are search results)
  • match attribute:
    • whether a substring or an exact match is requested
    • possible values
      • exact: an exact match is requested
      • substring: a substring match is requested
    • this is useful only for <query> elements with search-field set to true
    • exact is used only when querying the Form Builder summary page's app and form names, but this might be extended in the future
    • NOTE: The exact semantics for substring is not specified at the moment (the search implementation may use a "starts-with", a "contains" logic, or something else).
  • text value of the <query> element:
    • the search string entered by the user, if any
    • this is useful only for <query> elements with search-field set to true

Search paths

The path attribute can be looked at a search key. Say your documents looks like this:

<form>
    <personal-info>
        <first-name>John</first-name>
        <last-name>Doe</last-name>
        <birth-date>1980-01-01</birth-date>
    </personal-info>
</form>

Then the path personal-info/first-name matches XML elements following the XPath notation, relative to the root element of the document.

Note that for a given form definition, there is a limited number of search paths, which are fully determined by the XPath expressions present on the form definition's control bindings and model binds for controls that have the fr-summary or fr-search classes.

The paths sent by Form Runner are usually simple paths, but there are some exceptions:

  • Form Builder uses more complex paths out of the box, for example:

      xhtml:head/
        xforms:model[@id = 'fr-form-model']/
          xforms:instance[@id = 'fr-form-metadata']/
            */
              description[@xml:lang = $fb-lang]
    
  • Forms where user has entered XPath expressions manually to bind controls to XML data will contain those paths.

If the persistence layer is able to run XPath expressions (as eXist or other XML databases can), then the path provided by the search request can be executed. However, XPath evaluation is not absolutely required. For example with non-XML (relational, key/value) data stores, it is easier to consider the search path as an opaque search key that uniquely identifies the field to search.

Old query elements

  • 2013-05-24: Removal
    • All the previously deprecated elements mentioned below have been removed, and are not sent to the search API.
  • 2011-11-22: Deprecation
    • app and form elements:
      • these elements are deprecated and will be removed in the future. As of Orbeon Forms 3.9, they are present but empty
      • to obtain the app and form name being queried, extract them instead from the search URL
    • sort-key element:
      • this element was present but never used and will be removed in the future

Paging

The page-size and page-number elements control paging.

  • page-size attribute: how many results to return at the most
  • page-number attribute: page number to result, starting with 1

Language

The lang element contains the current language used by the user in the summary page. If the form contains language-dependent fields, this information can be used to refine the search. Only Form Builder makes use of this capability.

For example, Form Builder allows storing a form title and description in more than one language. When searching the form title field, the expectation of the user is likely to be that, if the user interface is in English, the English title will be searched and not, for example, the Italian title. In this case, the search path contains a special variable, $fb-lang, which allows the persistence implementation to discriminate on the language:

xhtml:head/
  xforms:model[@id = 'fr-form-model']/
    xforms:instance[@id = 'fr-form-metadata']/
      */
        description[@xml:lang = $fb-lang]

In persistence implementations that support XPath, the variable can be either scoped or replaced in the path by the value provided by the incoming <lang> element.

In persistence implementations that don't support XPath, this kind of queries are a little more tricky. The path can:

  • either be used as a plain key and ignore the <lang> element (therefore ignoring the language-sensitive search)
  • or be translated to the native query language of the persistence implementations

Drafts

The <drafts> element is used to tell the implementation of the persistence API which documents we're interested in, and thus which documents it must return:

  • exclude: don't include drafts.
  • only: only return drafts
  • include: return both drafts and non-drafts. This is the default if the <drafts> element is missing.

NOTE: exclude is not used in 4.6.2, but it could be used in the future.

The <drafts> element can have the following attributes when the value is only or include:

  • for-document-id="$document-id": return only drafts with the given document id
  • for-never-saved-document="true": return only drafts for documents never saved

This is what Form Runner uses as of Orbeon Forms 4.6.2:

  • In the Summary page
    • No <drafts> element, which means to retrieve drafts and non-drafts alike
    • If drafts-for-never-saved-document=true in the URL
      • <drafts for-never-saved-document="true">only</drafts>
  • In edit mode, if a draft is not explicitly requested in the URL with draft=true
    • <drafts for-document-id="$document-id">only</drafts for-document-id="$document-id"></drafts>
  • In new mode
    • <drafts for-never-saved-document="true"``>only</drafts>

Here are examples of requests/response in edit mode. Only zero or one document can be returned:

<search>
    <drafts for-document-id="fbba3db82e7fb1e0054e97d49026b5d303a1fa2f">only</drafts>
    <page-size>10</page-size>
    <page-number>1</page-number>
    <lang>en</lang>
</search>

<documents total="5" search-total="1">
    <document created="2014-08-15T17:12:04.570Z" last-modified="2014-08-15T17:18:50.993Z" draft="true"
              name="fbba3db82e7fb1e0054e97d49026b5d303a1fa2f" operations="*">
        <details/>
    </document>
</documents>

And in new mode, zero or more documents can be returned:

<search>
    <drafts for-never-saved-document="true">only</drafts>
    <page-size>10</page-size>
    <page-number>1</page-number>
    <lang>en</lang>
</search>

<documents total="4" search-total="2">
    <document created="2014-08-15T17:05:11.563Z" last-modified="2014-08-15T17:05:11.563Z" draft="true"
              name="dac2971cca0e71e36880e890297ab8818a5298e0" operations="*">
        <details/>
    </document>
    <document created="2014-08-15T17:04:55.383Z" last-modified="2014-08-15T17:04:55.383Z" draft="true"
              name="b0e28c1ed4ea6cfab445b40bb9dcb8bc6c296c92" operations="*">
        <details/>
    </document>
</documents>

Query response

The persistence layer must return a document of this form:

<documents total="13" search-total="4" page-size="10" page-number="1" query="">
    <document created="2011-05-06T14:58:40.376-07:00"
              last-modified="2011-09-12T12:05:07.3-07:00"
              name="e8bfd3ba63fa12a8b59cdd5c08369a35"
              draft="false">
        <details>
            <detail>The Terror</detail>
            <detail>Dan Simmons</detail>
            <detail>en</detail>
        </details>
    </document>
    <document created="2011-05-06T14:58:39.611-07:00"
              last-modified="2011-09-12T12:05:06.914-07:00"
              name="9531a191c77b75c417e9874427fa21f7"
              draft="false">
        <details>
            <detail>The Little Prince</detail>
            <detail>Antoine de Saint-Exupéry</detail>
            <detail>fr</detail>
        </details>
    </document>
    <!-- ... more <document> elements ... -->
</documents>

The root element contains these attributes:

  • total attribute:
    • total number of documents in the database for this app/form
  • search-total attribute:
    • number of documents matched by the current search
  • page-size attribute:
    • echo of the query's attribute
    • this might be removed in the future
  • page-number attribute:
    • echo of the query's attribute
    • this might be removed in the future
  • query attribute:
    • echo of the full-text query text
    • this might be removed in the future

For each of the documents found, a <document> element is returned:

  • created attribute:
    • creation date in ISO format
  • last-modified attribute:
    • last modification date in ISO format
  • name attribute:
    • document identifier,
    • this must match the identifier created by Form Runner when saving the data
  • draft attribute:
    • true or false, depending on whether the form data is a draft (autosaved) or not
  • operations attribute:

Each document contains one element in the order determined by the <query> elements with a summary-field set to true in the request. The text value of the <detail> element is the value of the field in the document found.