The quality of data collapse¶

The quality function¶

In the following, we present a measure by Houdayer & Hartmann [HH04] for the quality of the data collapse. Melchert [Mel09] refers to some alternative measures, for example [BS01], [WBJS08], and to some applications of these measures in the literature.

Houdayer & Hartmann [HH04] refine a method proposed by Kawashima & Ito [KI93]. They define the quality as the reduced \(\chi^2\) statistic

\[S = \frac{1}{\mathcal{N}} \sum_{i,j} \frac{(y_{ij} - Y_{ij})^2}{dy_{ij}^2+dY_{ij}^2},\]

where the values \(y_{ij}, dy_{ij}\) are the scaled observations and its standard errors at \(x_{ij}\), and the values \(Y_{ij}, dY_{ij}\) are the estimated value of the master curve and its standard error at \(x_{ij}\).

The quality \(S\) is the mean square of the weighted deviations from the master curve. As we expect the individual deviations \(y_{ij} - Y_{ij}\) to be of the order of the individual error \(\sqrt{dy_{ij}^2 + dY_{ij}^2}\) for an optimal fit, the quality \(S\) should attain its minimum \(S_{\min}\) at around \(1\) and be much larger otherwise [BR03].

Let \(i\) enumerate the system sizes \(L_i\), \(i = 1, \ldots, k\) and let \(j\) enumerate the parameters \(\varrho_j\), \(j = 1, \ldots, n\) with \(\varrho_1 < \varrho_2 < \ldots < \varrho_n\). The scaled data are

\[\begin{split}y_{ij} & := L_i^{-\zeta/\nu} a_{L_i, \varrho_j} \\ dy_{ij} & := L_i^{-\zeta/\nu} da_{L_i, \varrho_j} \\ x_{ij} & := L_i^{1/\nu}(\varrho_j - \varrho_c).\end{split}\]

The sum in the quality function \(S\) only involves terms for which the estimated value \(Y_{ij}\) of the master curve at \(x_{ij}\) is defined. The number of such terms is \(\mathcal{N}\).

The master curve itself depends on the scaled data. For a given \(i\), \(L_i\), we estimate the master curve at \(x_{ij}\) by the two respective data from all the other system sizes which respectively enclose \(x_{ij}\): for each \(i \neq i\), let \(j'\) be such that \(x_{i'j'} \leq x_{ij} \leq x_{i'(j'+1)}\), and select the points \((x_{i'j'}, y_{i'j'}, dy_{i'j'}), (x_{i'(j'+1)}, y_{i'(j'+1)}, dy_{i'(j'+1)})\). Do not select points for some \(i'\), if there is no such \(j'\). If there is no such \(j'\) for all \(i'\), the master curve remains undefined at \(x_{ij}\).

Given the selected points \((x_l, y_l, dy_l)\), the local approximation of the master curve is the linear fit

\[y = mx + b\]

with weighted least squares [Str11]. The weights \(w_l\) are the reciprocal variances, \(w_l := 1/dy_{ij}^2\). The estimates and (co)variances of the slope \(m\) and intercept \(b\) are

\[\begin{split}\hat{b} &= \frac{1}{\Delta} (K_{xx}K_y - K_xK_{xy}) \\ \hat{m} &= \frac{1}{\Delta} (K K_{xy} - K_x K_y)\end{split}\]\[\hat{\sigma}_b^2 = \frac{K_{xx}}{\Delta} , \hat{\sigma}_m^2 = \frac{K}{\Delta}, \hat{\sigma}_{bm} = - \frac{K_x}{\Delta}\]

with \(K_{nm} := \sum w_l x_l^n y_l^m\), \(K := K_{00}\), \(K_x := K_{10}\), \(K_y := K_{01}\), \(K_{xx} := K_{20}\), \(K_{xy} := K_{11}\), \(\Delta := KK_{xx} - K_x^2\).

Hence, the estimated value of the master curve at \(x_{ij}\) is

\[Y_{ij} = \hat{m} x_{ij} + \hat{b}\]

with error propagation

\[dY_{ij}^2 = \hat{\sigma}^2 x_{ij}^2 + 2 \hat{\sigma}_{bm} x_{ij} + \hat{\sigma}_b^2.\]

Implementation in the fssa package¶

Routines¶

fssa.fssa.quality Quality of data collapse onto a master curve defined by the data

The quality of data collapse¶

The quality function¶

Refinement of the quality function¶

Implementation in the fssa package¶

Routines¶

pyfssa

Navigation

This Page