back start next


[start] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [ 19 ] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] [32] [33] [34] [35] [36] [37] [38] [39] [40] [41] [42] [43] [44] [45] [46] [47] [48] [49] [50] [51] [52] [53] [54] [55] [56] [57] [58] [59] [60] [61] [62] [63] [64] [65] [66] [67] [68] [69] [70] [71] [72] [73] [74] [75] [76] [77] [78] [79] [80] [81] [82] [83] [84] [85] [86] [87] [88] [89] [90] [91] [92] [93] [94] [95] [96] [97] [98] [99] [100] [101] [102] [103] [104] [105] [106] [107] [108] [109] [110] [111] [112] [113] [114] [115] [116] [117] [118] [119] [120] [121] [122] [123] [124] [125] [126] [127] [128] [129] [130] [131] [132] [133] [134]


19

Any time series (inhomogeneous)

Regularizing Operator

Homogeneous time series

of same variable

Any time series z (inhomogeneous)

Operator Q

Time series £2[z], new variable, same time points

(inhomogeneous)

FIGURE 3.1 Different operator types to study time series:

(a) Sampling an inhomogeneous time series at regular time intervals. The resulting homogeneous time series can be treated by standard methods of time series analysis.

(b) Computing a new variable from the initial variable while keeping the initial (in-homogeneous) time points. Example: computing a series of local volatility values from the initial price series.

An important distinction between operators has to be made. We introduce two operator types:

Microscopic operators depend on the actual sampling of the inhomogeneous time series. Eliminating some random ticks leads to very different results.

Macroscopic operators extract average behaviors of their time series argument. They are essentially immune to small variations of the individual ticks, including adding or eliminating few ticks.

A possible technical definition of macroscopic operators is that they have a well-defined limit when the price quotes become infinitely dense. Practically, if the price quotes are sufficiently dense inside the range of the operator, we are close enough to this limit. For inhomogeneous time series, macroscopic operators are better behaved and more robust than microscopic operators. For homogeneous time series, this distinction is unnecessary because the sampling frequency is fixed and there is no reason to take a continuous-time limit or to formally add or remove ticks. Moreover, because homogeneous time series analysis is based on the backward shift operator (which is microscopic), most of the conventional time series analysis becomes unusable for inhomogeneous time series. The classification between operators is further explored by Zumbach and Muller (2001).

Macroscopic operators can be represented by convolutions and are discussed in Section 3.3. The archetype of a macroscopic operator is the exponential moving average (EMA) that computes a moving average with an exponentially decaying weight of the past.

Microscopic operators are presented in Section 3.4. Examples are the time difference St between ticks (e.g., in Stj = tj - f/ i) and the backward shift operator defined in Section 3.4.1.



3.2 VARIABLES IN HOMOGENEOUS TIME SERIES

Basic variables such as the price, the return, the realized volatility, and the spread arc defined in Section 3.2. In order to capture the dynamics of the intraday market, some more variables such as the tick frequency are of interest.

3.2.1 Interpolation

Before defining different variables, the generation of homogeneous time series has to be explained. A homogeneous time series, although taken for granted in time series analysis, is an artifact that has to be constructed from the raw data, which is an inhomogeneous series with times tj and values zj = z(tj). The index j refers to the irregularly spaced sequence of the raw series. By utilizing an interpolation method, we construct a homogeneous time series with values at times to + iAt, regularly spaced by At, rooted at a time to. The index i refers to the homogeneous series.

The time fo + iAt is bracketed by two times tj of the raw series

/ = maxfj \tj < to +iAt) , tj < t0+iAt < fy+i (3-1)

We interpolate between fy and tj+\. The two most important interpolation methods are linear interpolation

to + iAt - tj>

z(to + iAt) = zr + --(zj+i - zr) (3.2)

tj+i - tj

and previous-tick interpolation (taking the most recent value),

z(to+iAt) = Zj (3.3)

which was already proposed by Wasserfallen and Zimmermann (1985).

Both methods which are illustrated by Figure 3.2 have their merits. Previous-tick interpolation respects causality as it exclusively uses information already known at time fo + A?, whereas linear interpolation uses information from time fy+i, which lies in the future of time fo + iAt. When using previous-tick interpolation over a gap (a long period of missing data) in the raw data, a spurious jump of z may be observed at the end of the gap, which may spoil a statistical analysis of extreme returns of z. In this example, linear interpolation would be the appropriate choice. As advocated in Miiller et al. (1990), linear interpolation is the appropriate method for a random process with identically and independently distributed (i.i.d.) increments. Many statistical studies and model estimations can be alternatively done with both interpolation methods, in practice. The difference between the results indicates the sensitivity to the choice of the interpolation method. The difference is often small, even negligible thanks to high-frequency data. In the empirical studies of the book, the choice of the interpolation method is discussed whenever it matters.



figure 3.2 interpolation methods to obtain a homogeneous time series: selectir\ values at equally spaced time points tj, indicated by dotted vertical lines. the inhomogeneous time sequence of raw observations is indicated by ticks below the horizontal time axis and by dashed vertical lines (only for the observations bracketing the time points tj two important interpolation methods are illustrated by empty circles: linear interpolatior (big circles) and previous-tick interpolation (small circles).

The transformation of an inhomogeneous time series to a homogeneous ont can also be understood as the result of a special microscopic time series operate which is discussed in Sections 3.3.1 and 3.4.2.

3.2.2 price

Prices of assets are the most important variables explored in finance. Depending on the market structure and the data supplier, prices are available as quotes tr different forms:

Bid-ask price pairs: and pask

Transaction prices (which may or may not be former bid or ask quotes)

Bid, ask, transaction prices in irregular sequence (not in pairs, not synchronous)

Middle prices

One individual observation at a time tj, also in the case of bid-ask pairs, is callei. a tick.

Bid-ask price pairs are discussed first. FX prices and other asset prices, a: well as nonprice variables such as spot interest rates and implied volatility figure: from option markets, are quoted as bid-ask pairs. The most important variable



[start] [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [ 19 ] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] [32] [33] [34] [35] [36] [37] [38] [39] [40] [41] [42] [43] [44] [45] [46] [47] [48] [49] [50] [51] [52] [53] [54] [55] [56] [57] [58] [59] [60] [61] [62] [63] [64] [65] [66] [67] [68] [69] [70] [71] [72] [73] [74] [75] [76] [77] [78] [79] [80] [81] [82] [83] [84] [85] [86] [87] [88] [89] [90] [91] [92] [93] [94] [95] [96] [97] [98] [99] [100] [101] [102] [103] [104] [105] [106] [107] [108] [109] [110] [111] [112] [113] [114] [115] [116] [117] [118] [119] [120] [121] [122] [123] [124] [125] [126] [127] [128] [129] [130] [131] [132] [133] [134]