Q (programming language from Kx Systems) explained

q
Paradigm:Array, functional
Year:2003[1]
Designer:Arthur Whitney
Developer:Kx Systems
Latest Release Version:4.0
Latest Release Date:[2]
Typing:Dynamic, strong
Influenced By:A+, APL, Scheme, k

Q is a programming language for array processing, developed by Arthur Whitney. It is proprietary software, commercialized by Kx Systems. Q serves as the query language for kdb+, a disk based and in-memory, column-based database. Kdb+ is based on the language k, a terse variant of the language APL. Q is a thin wrapper around k, providing a more readable, English-like interface. One of the use cases is financial time series analysis, as one could do inexact time matches. An example is to match the a bid and the ask before that. Both timestamps slightly differ and are matched anyway.[3]

Overview

The fundamental building blocks of q are atoms, lists, and functions. Atoms are scalars and include the data types numeric, character, date, and time. Lists are ordered collections of atoms (or other lists) upon which the higher level data structures dictionaries and tables are internally constructed. A dictionary is a map of a list of keys to a list of values. A table is a transposed dictionary of symbol keys and equal length lists (columns) as values. A keyed table, analogous to a table with a primary key placed on it, is a dictionary where the keys and values are arranged as two tables.

The following code demonstrates the relationships of the data structures. Expressions to evaluate appear prefixed with the q) prompt, with the output of the evaluation shown beneath:

q)`john / an atom of type symbol`johnq)50 / an atom of type integer50

q)`john`jack / a list of symbols`john`jackq)50 60 / a list of integers50 60

q)`john`jack!50 60 / a list of symbols and a list of integers combined to form a dictionaryjohn| 50jack| 60

q)`name`age!(`john`jack;50 60) / an arrangement termed a column dictionaryname| john jackage | 50 60

q)flip `name`age!(`john`jack;50 60) / when transposed via the function "flip", the column dictionary becomes a tablename age--------john 50jack 60

q)(flip (enlist `name)!enlist `john`jack)!flip (enlist `age)!enlist 50 60 / two equal length tables combined as a dictionary become a keyed tablename| age----| ---john| 50jack| 60

These entities are manipulated via functions, which include the built-in functions that come with Q (which are defined as K macros) and user-defined functions. Functions are a data type, and can be placed in lists, dictionaries and tables, or passed to other functions as parameters.

Examples

Like K, Q is interpreted and the result of the evaluation of an expression is immediately displayed, unless terminated with a semi-colon. The Hello world program is thus trivial:

q)"Hello world!""Hello world!"

The following expression sorts a list of strings stored in the variable x descending by their lengths:

x@idesc count each x

The expression is evaluated from right to left as follows:

  1. "count each x" returns the length of each word in the list x.
  2. "idesc" returns the indices that would sort a list of values in descending order.
  3. @ use the integer values on the right to index into the original list of strings.

The factorial function can be implemented directly in Q as

or recursively as

Note that in both cases the function implicitly takes a single argument called x - in general it is possible to use up to three implicit arguments, named x, y and z, or to give arguments local variable bindings explicitly.

In the direct implementation, the expression "til x" enumerates the integers from 0 to x-1, "1+" adds 1 to every element of the list and "prd" returns the product of the list.

In the recursive implementation, the syntax "$[condition; expr1; expr2]" is a ternary conditional - if the condition is true then expr1 is returned; otherwise expr2 is returned. The expression ".z.s" is loosely equivalent to 'this' in Java or 'self' in Python - it is a reference to the containing object, and enables functions in q to call themselves.

When x is an integer greater than 2, the following function will return 1 if it is a prime, otherwise 0:

The function is evaluated from right to left:

  1. "til x" enumerate the non-negative integers less than x.
  2. "2_" drops the first two elements of the enumeration (0 and 1).
  3. "x mod" performs modulo division between the original integer and each value in the truncated list.
  4. "min" find the minimum value of the list of modulo result.

The q programming language contains its own table query syntax called qSQL, which resembles traditional SQL but has important differences, mainly due to the fact that the underlying tables are oriented by column, rather than by row.

q)show t:([] name:`john`jack`jill`jane; age: 50 60 50 20) / define a simple table and assign to "t"name age--------john 50jack 60jill 50jane 20 q)select from t where name like "ja*",age>50 name age -------- jack 60 q)select rows:count i by age from t age| rows ---| ---- 20 | 1 50 | 2 60 | 1

Further reading

External links

Notes and References

  1. Web site: Q Language Widening the Appeal of Vectors . June 1, 2016 . unfit . https://web.archive.org/web/20070101213150/http://vector.org.uk/weblog/archive/000036.html . January 1, 2007.
  2. . Changes in 4.0 . Palo Alto . Kx Systems . Mar 17, 2020 . Apr 15, 2020.
  3. Web site: Q Reference Card. 15 April 2020.