# Visualising disease progression on multiple variables with vector plots and path plots

- Stanley E Lazic
^{1, 2}Email author, - Sarah L Mason
^{2}, - Andrew W Michell
^{3}and - Roger A Barker
^{2, 4}

**9**:32

**DOI: **10.1186/1471-2288-9-32

© Lazic et al; licensee BioMed Central Ltd. 2009

**Received: **11 November 2008

**Accepted: **27 May 2009

**Published: **27 May 2009

## Abstract

### Background

It is often desirable to observe how a disease progresses over time in individual patients, rather than graphing group averages; and since multiple outcomes are typically recorded on each patient, it would be advantageous to visualise disease progression on multiple variables simultaneously.

### Methods

A variety of vector plots and a path plot have been developed for this purpose, and data from a longitudinal Huntington's disease study are used to illustrate the utility of these graphical methods for exploratory data analysis.

### Results

Initial and final values for three outcome variables can be easily visualised per patient, along with the change in these variables over time. In addition to the disease trajectory, the path individual patients take from initial to final observation can be traced. Categorical variables can be coded with different types of vectors or paths (e.g. different colours, line types, line thickness) and separate panels can be used to include further categorical or continuous variables, allowing clear visualisation of further information for each individual. In addition, summary statistics such as mean vectors, bivariate interquartile ranges and convex polygons can be included to assist in interpreting trajectories, comparing groups, and detecting multivariate outliers.

### Conclusion

Vector and path plots are useful graphical methods for exploratory data analysis when individual-level information on multiple variables over time is desired, and they have several advantages over plotting each variable separately.

## Background

Clinical studies typically measure multiple outcomes on patients as well as record information on patient characteristics such as age, sex, genotype, disease severity, and age of onset. Many such studies are longitudinal, where initial or baseline values are obtained, and then patients are followed over time to observe how the disease progresses. Often the research question involves a comparison of two or more groups, such as an experimental and control group, or a comparison of progression between subgroups of patients with the disease. Numerous methods are available to analyse multiple observations on subjects over time, such as repeated measures ANOVA, multivariate ANOVA, derived-variable or summary-measure analysis (e.g. slopes, intercepts, area under the curve, etc. [1]), time-series analysis, mixed-effects models [2], and functional data analysis [3, 4], with the data often being graphically presented as either a line or bar graph, where the mean (averaged across subjects) and standard error of the mean are plotted at each time point. Alternatively, separate lines for each patient are occasionally used to show how individual patients change over time.

There is however a comparative lack of graphical methods to visualise more than one variable at multiple time points; this would be useful to help understand how individual patients progress on two or three variables simultaneously, and how each individual compares to the mean of their respective group or to all other patients. Current multivariate methods – both supervised and unsupervised – and associated graphical techniques mainly focus on finding groups, classes, clusters, or structure in the data, but generally do not consider changes over time on these variables [5, 6]. If time is included as a variable then 'multivariate' generally refers to multiple observations on a *single variable*, and the ability to visualise *multiple observations* on *multiple variables* for each individual would be of great use in understanding the results of many biomedical studies. This would be useful for exploratory data analysis (EDA), as it would allow for the detection of bivariate or multivariate outliers – patients whose values on any single variable are within the normal range, but whose values on a combination of variables is unusual. For example, a value of 189 cm (6'2") is well within the normal adult range for height, as is 59 kg (130 lbs) for weight; however, it would be unusual for *the same person* to have a height of 189 cm and a weight of only 59 kg. In practice, height and weight are combined and expressed as a body mass index (BMI = kg/m^{2}), and this individual's very low BMI could be detected with standard methods. However, most combinations of variables do not have such conventions to relate them to each other, and they cannot be easily expressed by a convenient method such as a product or sum. For example, in a patient with Huntington's disease (HD), there is no meaningful way to combine performance on a cognitive test such as the cognitive score of the Unified Huntington's Disease Rating Scale (UHDRS) with dopamine D_{2} receptor density in the striatum, as determined by PET imaging using ^{11}C raclopride [7–9]. Multivariate outlier detection is a necessary quality control step prior to statistical analysis, as it draws attention to data that may have been recorded incorrectly and which would not be detected by examining the values for each variable separately using standard graphical methods such as histograms or quantile plots. In addition, EDA allows for the detection of novel or interesting relationships in the data – relationships that may not have been predicted beforehand, and which might go unnoticed with standard analytical methods.

Due to the work of Tukey [10], Cleveland [11, 12], Cook and Swayne [13], and many others [14], it is now widely recognised that to fully appreciate the structure of data it must be examined visually. This is particularly true of multivariate data, and therefore we have developed several variations of a standard vector plot and used these to visualise disease progression in a cohort of patients with Huntington's disease from a recent paper by Michell et al. [15], especially the data shown in Figure three of that paper.

Vector plots are often used to graph information on wind speed and direction, fluid flow, magnetic fields, or to examine the behaviour of systems of differential equations. Typically, the base of the vectors (arrows) are arranged on a grid, and the length and/or thickness of the vectors encodes information on magnitude (e.g. wind speed), while the direction of the vectors relates to the direction of the phenomenon. In the neuroscience literature, vector plots have been used to represent the response of neurons in the motor cortex to movement in a particular direction, with the length of the vectors corresponding to the firing rate of the neurons [[16], p. 390–391]. In our graphs, the base of the arrows are not arranged on a grid but encode information on the initial values of multiple variables and the tips of the arrows are the final values on these same variables, with one vector for each patient. The length of the arrows therefore encodes the magnitude of change over time from initial to final values, while the direction of the vectors indicates the direction of change (increasing/decreasing, better/worse, etc. depending on what is being graphed). Since each vector represents one individual, it is possible to view how individuals change over time as well as how each individual compares to the rest of the sample. The length and direction of the vectors can be thought of as a disease trajectory, showing how patients progress in 'disease space' on multiple variables (assuming that these variables suitably reflect the disease state). Six pieces of information per patient can be easily visualised: initial and final values for three variables on a 3D graph, and further variables such as group membership (e.g. male vs. female) can be encoded by different types of vectors, for example vectors of different colour. Separate panels can also be used to plot additional categorical or continuous variables. The vector plots graphically display the net change from initial to final observation, but do not provide information on the route or path taken between these two time points. A 'path plot' is therefore introduced which is similar in principle to a vector plot, but it traces the progression of the disease over time.

The usefulness of examining patient-level data is recognised [17], and is particularly suitable for the evaluation of biomarkers, as it is not enough that a marker reliably tracks the progression of the disease on average, but that it does so sufficiently well for each individual patient [18]. In addition, it is likely that combinations of biomarkers may prove to be a more powerful method for following disease progression.

## Methods

### Patients and apparatus

Huntington's disease is a progressive neurodegenerative disorder caused by an increase in the number of glutamine amino acids in the huntingtin protein, due to an expansion in the number of CAG repeats encoding for glutamine in the first exon of the *huntingtin* gene. HD affects approximately 1 in 10,000 people with onset typically occurring in the fourth decade and progresses for some 10–20 years before becoming fatal [19]. Patients with genetically confirmed disease were recruited from the regional HD clinic at the Cambridge Centre for Brain Repair and were assessed every six months on the Unified Huntington's Disease Rating Scale (UHDRS [20]), Total Functional Assessment scale (TFA) and a hand tapping task; further information can be found in Michell et al. [15, 21]. Approval was obtained from the ethical review committee at Addenbrooke's Hospital, Cambridge, (Reference number: LREC95/086) in compliance with the Helsinki Declaration, and informed consent was obtained from the patients.

The tapping apparatus consisted of two buttons 6 cm in diameter, mounted with their centres 30 cm apart. The patients' task was to alternately tap one button after the other as rapidly as possible using the palm of one hand. The total number of taps made in 30 seconds was recorded for each hand, and the data are presented as the mean of the left hand and right hand scores. The UHDRS is a uniform assessment of the clinical features of HD and contains a motor function subscale, which measures patients' ability on a range of motor tasks including eye movements, speech, tongue protrusion, bradykinesia, dystonia, chorea, and gait. Asymptomatic individuals and controls have a value of zero and higher values indicate worse performance, with a maximum score of 120 on the motor subscale. The TFA is series of twenty five questions which assesses patients' functioning on five areas including work, finances, domestic chores, activities of daily living, and the level of care required. The scores range from 0–50, with higher scores indicating worse performance.

### Graphics and analysis

Figures were created with R (version 2.8.0) [22, 23], with code for the bivariate IQR ellipses adapted from Everitt [6]. The initial and final scores are provided in Additional File 1 and longitudinal data are provided in Additional File 2. R functions for some of the graphs are provided in Additional File 3 and information on the R language can be found in Venables and Ripley [24] or Crawley [25] as well as on the R website http://www.r-project.org. A good discussion of R graphing commands can be found in Murrell [26].

## Results and Discussion

### Visualising raw data

The basic vector plot is shown in Figure 2A, where the base of each grey arrow (closed circles) are the initial UHDRS and tapping values, and the tips of the arrows are the final values; the length of the arrow therefore represents the amount of disease progression. Patients progress from the base of the arrow at the initial assessment to the tip of the arrow at the final assessment, with the net change being graphed. The patients in this particular study were not at the same stage of the disease upon entry into the study and therefore some of the variability in the initial scores represents patients at a different stage of the disease. It should be noted that these patients were followed up for different lengths of time (mean = 6.8 years, range = 5–8 years) and therefore the length of the vectors is not directly comparable in Figure 2 and Figure 3A, as one patient may have a longer arrow (i.e. greater apparent disease progression) simply because they have been followed up for a longer time. This is a shortcoming of the dataset and not the graphical method, and can be easily accommodated (see below). Several summary statistics can be added to the basic graph to assist in visualing general trends. The first is the mean vector (black arrow in Fig 2A), which is simply the average of the initial and final values for each variable. If the data are skewed or contain outliers, then the median vector could also be used as a more robust measure of central tendency. An alternate method of displaying the mean (or median) values is to plot the projection of the mean vector onto the *x* and *y* axes. This is shown in Figure 2B, where the blue circles on the *x* and *y* axes represent the mean initial values for the number of taps and UHDRS score, respectively. The red diamonds on the axes represent the mean final values, and the distance between them is the average change. This has the advantage of making it easier to estimate the mean values and has less clutter on the plotting region. Figure 2B also plots the bivariate interquartile range (IQR), which is interpreted in the same way as a univariate IQR: 50% of the initial (blue) and final (red) values lie within their respective ellipses. This gives a visual representation of the dispersion of the values as well as the overlap of the middle portion of the initial and final values. The shape of the ellipses also provides information on the correlation between the two variables at each time point; the ellipses would be circular with no correlation, and the more elongated the ellipses the greater the correlation.

Figure 2C uses two other techniques to highlight characteristics of the data. Instead of standard axes as in the previous graphs, the axes extend from the lowest to the highest values and thus indicate the range of the data [27]. In addition, the data at each time point are enclosed in convex polygons, making it easier to visually cluster the initial and final values. A convex hull is the smallest subset of points that when connected with line segments, enclose the entire set of points, and is conveniently graphed as a shaded polygon. This plot takes a macroscopic view of the data and highlights the range and any extreme scores. For this graph it can be seen that the initial values are within a narrower range than the final values for both the tapping and UHDRS scores, implying that patients are more alike at the initial observation and tend to diverge over time.

### Visualising rate of change or change scores

The previous graphs plotted the raw scores, which has the advantage of visualising the data in the units that they are measured. However, it will often be useful to adjust for initial differences, either because people may be at different stages of the disease upon entry into the study or due to natural heterogeneity in the sample. Adjusting for initial differences also makes it easier to compare changes between patients when they all start with a common baseline. This is shown in Figure 3A, where data for each variable are represented as change scores (final minus initial values). From this graph it can be seen that two patients clearly stand out; one improved on the tapping score and the other improved on the UHDRS score.

As mentioned above, these patients were followed up for different lengths of time and therefore the lengths of the vectors are not directly comparable. However the vectors can be normalised by dividing the change scores by the length of follow-up time for each patient, and this represents the rate of change (i.e. average change per year; Fig 3B). The relative lengths of the vectors has not changed much with this particular dataset, as the follow-up time for each subject, while not identical, was not vastly different.

### Adding a third variable

A third axis can be added to include another outcome variable, allowing six pieces of information per patient to be graphed. An example is shown in Figure 5, where values on the Total Functional Assessment (TFA) score – another clinical measure – are included on a third axis. This requires 3-dimensional graphs, and the ability to rotate the graph in real time is necessary to fully appreciate the orientation of the vectors. The graphs in Figure 5 were created with the rgl package [29], which enables zooming and rotation in any direction. In addition, making the vector plots interactive by integrating them into multivariate visualisation tools such as GGobi [13] would enable techniques such as 'brushing' to be used. Brushing involves using the cursor to select a graphical object in one plot, such as a vector, and data corresponding to the same individual (or a class that the individual is in) are highlighted in other plots. For example, putting the cursor over the only vector in the top right quadrant of Figure 3B would highlight this individual in another graph, which might plot age of onset for each sex separately as a dot plot. One could then check if this patient had a particularly early or late age of onset and their sex. Values for individual patients can be linked up across multiple panels and variables, which provides a powerful method to examine the multivariate nature of data obtained in many clinical studies.

### Tracking disease over time

*path*in Wilkinson's system of describing graphical components [30]. The smoothness of the line can be adjusted to follow the data closely, which allows smaller trends and fluctuations to be detected, but will also pick up 'uninteresting' changes such as the natural variability in the repeated observations and measurement error. Alternatively, a stronger smoothing function can be used to average over the smaller fluctuations and observe only the larger trends in the data. Figure 6 used strong smoothing to highlight the overall trends, and the large black circles in this figure are the initial values, and the end of the paths (lines) are the final values (arrow heads are omitted to avoid clutter). The small dots along the line serve as a time stamp and are six months apart (the intervals at which the data were collected); this allows one to not only observe disease trajectories but how the trajectories evolve over time. Equally spaced dots (alternatively, equal lengths of line segments between dots) indicates a consistent rate of disease progression. Dots that are close together imply little progression over time whereas dots that are farther apart indicate faster progression (or improvement). With this particular dataset, some patients received neural transplantations of human foetal striatal tissue into the striatum. For the transplant patients, the line changes from black to red at the first post transplant assessment, allowing any changes in disease trajectories to be visualised after treatment. More generally, this type of plot is suitable to visualise the effect of any intervention where multiple baseline and post-treatment observations are recorded. In addition, it is a method of distinguishing subgroups of patients that progress differently over time (e.g. steady rate, accelerating, levelling-off), and which may be related to other environmental or biological factors. For example, there is increasing evidence for heterogeneity in Parkinson's disease [31–33], with faster progression in a subgroup of patients that are older, have a non-tremor dominant phenotype, and deficits in semantic fluency [34]. A path plot might be useful to visually classify individuals based on their disease progression, and one could then examine whether the subgroups differ in other respects such as gene expression, imaging results, or known risk factors for the disease. Path plots can also include a third variable but are difficult to visualise in print, and so an animated GIF can be found in Additional File 6, which plots TFA scores on the third axis.

### Advantages of vector and path plots

There are a number of advantages of using these plots for exploratory data analysis. The first, which was already mentioned in the introduction, is that they can facilitate the detection of multivariate outliers, and understanding the structure of the data will assist in the final modelling and analysis. Second, information on individual patients can be graphed, allowing comparisons of individuals with group trends; in clinical studies it is often important to observe data at the level of the individual patient rather than simply averaged responses. Third, summary statistics such as mean vectors and bivariate interquartile ranges can be included on the graph, along with visual guides such as convex polygons to describe the population of vectors. Fourth, since the length of the (normalised) vector is on a ratio scale, its interpretation is straightforward: a person with a vector twice as long as another has progressed twice as fast. Finally, missing values or a different number of observations between patients do not pose any particular difficulty for these plots. This is an important attribute because it is not uncommon to have missing data in longitudinal studies – for example if patients are unavailable for a particular assessment.

### Disadvantages

The main disadvantage is that these plots are not readily available in major statistical or graphical software packages. Some of the R code is provided, but familiarity with the R language is required in order to use it. It is hoped that these methods will become more widely available in the future.

### Further extensions

Vector plots were used in the present paper to compare changes on a clinical measure of motor dysfunction and a simple hand tapping measure that is under evaluation as a potential biomarker in HD. Other potential biomarkers such as quantitative oculometry [35] and olfactory functioning [36] can discriminate between HD patients and controls in cross-sectional studies, and whole-brain atrophy has been used with a six month follow-up period [37]. If longitudinal data were available for these methods, vector and path plots would be useful for determining which biomarker tracks disease progression better. These plots are also suitable for comparing two different assessment methods such as a novel method with a gold-standard, in addition to other existing graphical methods [38–41], and assuming that one has a gold-standard by which to compare these novel methods or potential biomarkers. Another use for these plots is to examine left versus right asymmetries. For example, Parkinson's disease often presents with one side more affected than the other (unilateral onset is a UK Parkinson's Disease Society Brain Bank criteria for diagnosis [42]) and changes in asymmetry could be tracked over time. The data used in this paper were from a longitudinal study with multiple observations, but vector plots can also be used for simple pre versus post designs. In addition, there is nothing restricting such plots to human clinical data and they would also be suitable for many preclinical animal studies.

Instead of plotting the actual values of the outcome variables, the parameters of summary or distributional statistics could also be graphed. For example, in addition to the longitudinal study in the original paper [15], it also contained a cross-sectional study comparing tapping scores between HD patients and controls. In the cross-sectional study, not only were the number of taps determined, but also the variability in the time between successive taps (the inter-tap interval). If this data was also collected for the longitudinal study, then both the number of taps and the variability of the inter-tap interval could have been plotted over time. In other words, the parameters (mean and variance) of a distribution of taps could have been plotted for each subject at different time points (in practice, the interdecile range was used rather than the variance as the distributions had some outliers). While plotting parameter values for distributions is more abstract then plotting raw data values, these plots can be used to visualise changes in parameter space over time and they also have a straightforward interpretation.

The thickness of the vectors could also be used to encode information such as class membership (e.g. male vs. female), in which case only two levels of thickness would be used. Alternatively, the vector thickness could be used to represent a continuous variable such as the variability in the original measurements, which would allow for different patterns of variability to be visualised.

It might be difficult to observe individual values and their trajectories if there are many patients. This can be partly overcome by using semi-transparent vectors (also referred to as *alpha blending* or *splatting*) so that vectors that are underneath others can be partially seen. Alternatively, subsets of patients could be selected and plotted rather than all the patients at once. Subsetting can be achieved by breaking the data down by groups or conditions, or random subsets of the data can be plotted in a number of different panels so that all the data can be seen at once.

## Conclusion

Vector plots – using either raw data or change-scores – and path plots provide novel graphical techniques for visualising how individual patients or subjects change over time on multiple variables. These plots are useful for comparing groups on two or more variables, detecting multivariate outliers, and detecting subgroups of patients that have different disease trajectories. They are a useful addition to standard graphical exploratory data analysis methods and can be used to gain new insights into longitudinal data and thus the natural progression of many conditions, as well as how treatments affect disease trajectories.

## Declarations

### Acknowledgements

SEL is supported by a Cancer Research UK bursary and the Cambridge Commonwealth Trust. AWM was supported through a PDS Clinical Fellowship. RAB is supported by grants from the MRC, PDS, Euro-HD, and an NIHR Biomedical Research Centre Award to Addenbrooke's Hospital Trust. We would also like to thank Prof. Brian Everitt for permission to reproduce the bivariate IQR R function.

## Authors’ Affiliations

## References

- Matthews JN, Altman DG, Campbell MJ, Royston P: Analysis of serial measurements in medical research. BMJ. 1990, 300 (6719): 230-235. 10.1136/bmj.300.6719.230.View ArticlePubMedPubMed CentralGoogle Scholar
- Pinheiro JC, Bates DM: Mixed-Effects Models in S and S-Plus. 2000, London: SpringerView ArticleGoogle Scholar
- Ramsay JO, Silverman BW: Applied Functional Data Analysis: Methods and Case Studies. 2002, New York, NY: SpringerView ArticleGoogle Scholar
- Ramsay JO, Silverman BW: Functional Data Analysis. 2006, New York, NY: Springer, 2Google Scholar
- Hastie T, Tibshirani R, Friedman J: The Elements of Statistical Learning. 2001, New York: SpringerView ArticleGoogle Scholar
- Everitt B: An R and S-Plus Companion to Multivariate Analysis. 2005, London: SpringerView ArticleGoogle Scholar
- Pavese N, Gerhard A, Tai YF, Ho AK, Turkheimer F, Barker RA, Brooks DJ, Piccini P: Microglial activation correlates with severity in Huntington disease: a clinical and PET study. Neurology. 2006, 66 (11): 1638-1643. 10.1212/01.wnl.0000222734.56412.17.View ArticlePubMedGoogle Scholar
- Reuter I, Tai YF, Pavese N, Chaudhuri KR, Mason S, Polkey CE, Clough C, Brooks DJ, Barker RA, Piccini P: Long-term clinical and positron emission tomography outcome of fetal striatal transplantation in Huntington's disease. J Neurol Neurosurg Psychiatry. 2008, 79 (8): 948-951. 10.1136/jnnp.2007.142380.View ArticlePubMedGoogle Scholar
- Politis M, Pavese N, Tai YF, Tabrizi SJ, Barker RA, Piccini P: Hypothalamic involvement in Huntington's disease: an in vivo PET study. Brain. 2008Google Scholar
- Tukey JW: Exploratory Data Analysis. 1977, Addison-WesleyGoogle Scholar
- Cleveland WS: Visualizing Data. 1993, Summit, NJ: Hobart PressGoogle Scholar
- Cleveland WS: The Elements of Graphing Data. 1994, New Jersey: Hobart Press, revised editionGoogle Scholar
- Cook D, Swayne DF: Interactive and Dynamic Graphics for Data Analysis. 2007, New York, NY: Springer, [http://www.ggobi.org]View ArticleGoogle Scholar
- Fayyad U, Grinstein GG, Wierse A, Eds: Information Visualization in Data Mining and Knowledge Discovery. 2002, London: Academic PressGoogle Scholar
- Michell AW, Goodman AO, Silva AH, Lazic SE, Morton AJ, Barker RA: Hand tapping: a simple, reproducible, objective marker of motor dysfunction in Huntington's disease. J Neurol. 2008, 255 (8): 1145-1152. 10.1007/s00415-008-0859-x.View ArticlePubMedGoogle Scholar
- Gazzaniga MS, Ivry RB, Mangun GR: Cognitive Neuroscience: The Biology of the Mind. 1998, New York, NY: W. W. Norton & CompanyGoogle Scholar
- Brown CG, McGuire DB, Beck SL, Peterson DE, Mooney KH: Visual graphical analysis: a technique to investigate symptom trajectories over time. Nurs Res. 2007, 56 (3): 195-201. 10.1097/01.NNR.0000270029.82736.5a.View ArticlePubMedGoogle Scholar
- Michell AW, Lewis SJG, Foltynie T, Barker RA: Biomarkers and Parkinson's disease. Brain. 2004, 127 (Pt 8): 1693-1705. 10.1093/brain/awh198.View ArticlePubMedGoogle Scholar
- Ho LW, Carmichael J, Swartz J, Wyttenbach A, Rankin J, Rubinsztein DC: The molecular biology of Huntington's disease. Psychol Med. 2001, 31: 3-14. 10.1017/S0033291799002871.View ArticlePubMedGoogle Scholar
- Unified Huntington's Disease Rating Scale: reliability and consistency. Huntington Study Group. Mov Disord. 1996, 11 (2): 136-142. 10.1002/mds.870110204.
- Rosser AE, Barker RA, Harrower T, Watts C, Farrington M, Ho AK, Burnstein RM, Menon DK, Gillard JH, Pickard J, Dunnett SB: NEST-UK: Unilateral transplantation of human primary fetal tissue in four patients with Huntington's disease: NEST-UK safety report ISRCTN no 36485475. J Neurol Neurosurg Psychiatry. 2002, 73 (6): 678-685. 10.1136/jnnp.73.6.678.View ArticlePubMedPubMed CentralGoogle Scholar
- Ihaka R, Gentleman R: R: a language for data analysis and graphics. J Comput Graph Stat. 1996, 5: 299-314. 10.2307/1390807.Google Scholar
- R Development Core Team: R: A Language and Environment for Statistical Computing. 2008, R Foundation for Statistical Computing, Vienna, Austria, [http://www.r-project.org]Google Scholar
- Venables WN, Ripley BD: Modern Applied Statistics with S. 2002, New York: Springer, 4View ArticleGoogle Scholar
- Crawley MJ: The R Book. 2007, Chichester: WileyView ArticleGoogle Scholar
- Murrell P: R Graphics. 2005, Boca Raton, FL: Chapman & Hall/CRCView ArticleGoogle Scholar
- Tufte ER: The Visual Display of Quantitative Information. 2001, Cheshire, CT: Graphics Press, 2Google Scholar
- Byth K, Cox DR: On the relation between initial value and slope. Biostatistics. 2005, 6 (3): 395-403. 10.1093/biostatistics/kxi017.View ArticlePubMedGoogle Scholar
- Adler D, Murdoch D: rgl: 3D visualization device system (OpenGL). 2008, [http://rgl.neoscientists.org]Google Scholar
- Wilkinson L: The Grammar of Graphics. 2005, New York, NY: Springer, 2Google Scholar
- Foltynie T, Brayne C, Barker RA: The heterogeneity of idiopathic Parkinson's disease. J Neurol. 2002, 249 (2): 138-145. 10.1007/PL00007856.View ArticlePubMedGoogle Scholar
- Lewis SJG, Foltynie T, Blackwell AD, Robbins TW, Owen AM, Barker RA: Heterogeneity of Parkinson's disease in the early clinical stages using a data driven approach. J Neurol Neurosurg Psychiatry. 2005, 76 (3): 343-348. 10.1136/jnnp.2003.033530.View ArticlePubMedPubMed CentralGoogle Scholar
- Goris A, Williams-Gray CH, Clark GR, Foltynie T, Lewis SJG, Brown J, Ban M, Spillantini MG, Compston A, Burn DJ, Chinnery PF, Barker RA, Sawcer SJ: Tau and alpha-synuclein in susceptibility to, and dementia in, Parkinson's disease. Ann Neurol. 2007, 62 (2): 145-153. 10.1002/ana.21192.View ArticlePubMedGoogle Scholar
- Williams-Gray CH, Foltynie T, Brayne CEG, Robbins TW, Barker RA: Evolution of cognitive dysfunction in an incident Parkinson's disease cohort. Brain. 2007, 130 (Pt 7): 1787-1798. 10.1093/brain/awm111.View ArticlePubMedGoogle Scholar
- Ali FR, Michell AW, Barker RA, Carpenter RHS: The use of quantitative oculometry in the assessment of Huntington's disease. Exp Brain Res. 2006, 169 (2): 237-245. 10.1007/s00221-005-0143-6.View ArticlePubMedGoogle Scholar
- Lazic SE, Goodman AO, Grote HE, Blakemore C, Morton AJ, Hannan AJ, van Dellen A, Barker RA: Olfactory abnormalities in Huntington's disease: decreased plasticity in the primary olfactory cortex of R6/1 transgenic mice and reduced olfactory discrimination in patients. Brain Res. 2007, 1151: 219-226. 10.1016/j.brainres.2007.03.018.View ArticlePubMedGoogle Scholar
- Henley SMD, Frost C, MacManus DG, Warner TT, Fox NC, Tabrizi SJ: Increased rate of whole-brain atrophy over 6 months in early Huntington disease. Neurology. 2006, 67 (4): 694-696. 10.1212/01.wnl.0000230149.36635.c8.View ArticlePubMedGoogle Scholar
- Bland JM, Altman DG: Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986, 1 (8476): 307-310.View ArticlePubMedGoogle Scholar
- Bland JM, Altman DG: Measuring agreement in method comparison studies. Stat Methods Med Res. 1999, 8 (2): 135-160. 10.1191/096228099673819272.View ArticlePubMedGoogle Scholar
- Bland JM, Altman DG: Applying the right statistics: analyses of measurement studies. Ultrasound Obstet Gynecol. 2003, 22: 85-93. 10.1002/uog.122.View ArticlePubMedGoogle Scholar
- Bland JM, Altman DG: Agreement between methods of measurement with multiple observations per individual. J Biopharm Stat. 2007, 17 (4): 571-582. 10.1080/10543400701329422.View ArticlePubMedGoogle Scholar
- Pahwa R, Lyons KE, Koller WC: Handbook of Parkinson's Disease, Differential Diagnosis of Parkinsonism. Edited by: Sethi KD. 2003, New York, NY: Informa Health Care, chap 3: 43-70. 3Google Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2288/9/32/prepub

### Pre-publication history

## Copyright

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.