DAFS-BR Validation
DAFS-BR Validation
Abstract
The Direct Assessment of Functional Status-Revised (DAFS-R) is an instrument developed to objectively measure functional capacities
required for independent living. The objective of this study was to translate and culturally adapt the DAFS-R for Brazilian Portuguese
(DAFS-BR) and to evaluate its reliability and validity. The DAFS-BR was administered to 89 older patients classified previously as
normal controls, mild cognitive impairment (MCI) and Alzheimer’s disease (AD). The results indicated good internal consistency
(Cronbach’s a ¼ 0.78) in the total sample. The DAFS-BR showed high interobserver reliability (0.996; p , .001) as well as test–retest stab-
ility over 1-week interval (0.995; p , .001). Correlation between the DAFS-BR total score and the Informant Questionnaire on Cognitive
Decline in the Elderly (IQCODE) was moderate and significant (r ¼ 2.65, p , .001) in the total sample, whereas it did not reach statistical
significance within each diagnostic group. Receiver operating characteristic curve analyses suggested that DAFS-BR has good sensitivity and
specificity to identify MCI and AD. Results suggest that DAFS-BR can document degrees of severity of functional impairment among
Brazilian older adults.
Keywords: Functional status; Cross-cultural adaptation; Alzheimer’s disease, MCI, reliability; Validity
Introduction
Functional impairment is part of the criteria for the diagnosis of dementia syndromes. According to different international
criteria available for the diagnosis of dementia, documentation of functional decline based on subjective or objective assess-
ment is required. Assessing functionality is also important to determine the level of assistance and supervision necessary for the
patient to continue to perform activities in his/her environment. More recently, with the growing interest in the detection of
preclinical dementia and the diagnosis of Mild Cognitive Impairment (MCI; Petersen et al., 1999), the establishment of
intact functioning has become even more relevant.
Most instruments used to assess functional status of older adults in clinical and research settings focus on information given
by a family member or caregiver. These measurements can under- or overestimate functional decline because they are suscep-
tible to potential reporter biases such as mood, personality, or burnout (Loewenstein et al., 2001; Onor, Trevisiol, Negro, &
Aguglia, 2006; Tierney, Szalai, Snow, & Fisher, 1996). Therefore, family members’ and nursing staff’s perceptions may be
incongruent in the estimation of the patients’ true abilities (Lukovitz & McDaniel, 1992). An alternative way to assess func-
tioning in everyday life is to obtain objective information using performance-based tests.
The Direct Assessment of Functional Status-Revised (DAFS-R) for older adults is an instrument developed to directly
evaluate a broad array of functional capacities required for independent living in older patients with and without cognitive
impairment. DAFS-R significantly discriminated patients with Alzheimer’s disease (AD) from normal elderly and from a
# The Author 2010. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: [email protected].
doi:10.1093/arclin/acq029
336 F.S. Pereira et al. / Archives of Clinical Neuropsychology 25 (2010) 335–343
group of elderly outpatients with major depression (Loewenstein et al., 1989). In this article, the authors demonstrated that
DAFS-R is strongly associated with functional measures, based on caregivers’ reports, and it can adequately discriminate
patients who are perceived by clinicians and family members as having deficits in five instrumental activities of daily
living (driving, telling time, remembering lists, and basic and complex financial tasks) from those who do not show such def-
icits. It was also sensitive to functional decline after 1 year, and it was useful to establish longitudinal patterns of deterioration
(Loewenstein et al., 1995).
The American version of DAFS-R objectively tests seven domains: Time Orientation, Communication skills, Dealing with
Finances, Shopping skills, Grooming skills, Eating skills, and Driving skills. Reliability, convergent, and discriminative val-
idity have been established by Loewenstein and associates (Loewenstein et al., 1989; Loewenstein & Bates, 2006), in
English-speaking populations. The scale has also been translated into six different languages and has had its validity tested
in different cultures.
Similar functional assessment tools have been developed by occupational therapists, with published manuals and research
evidence on such instruments (Brown, 2009). The Bay Area Functional Performance Evaluation, for instance, uses tasks to
determine a patient’s ability to participate in goal-directed activity and to demonstrate appropriate behaviors (Houston,
Williams, Bloomer, & Mann, 1989). Another similar assessment is the performance assessment of self-care skills (Holm &
Rogers, 2008), a performance-based observational tool which consists of 26 tasks, 5 functional mobility tasks, 3 personal
Eighty-nine older adults were selected to participate (32 cognitively unimpaired, 31 patients with MCI, and 26 with AD).
These individuals are participants in a prospective study on cognitive aging and AD, in course at a university-based memory
clinic. They were recruited from community sources, including patients with spontaneous demand for assessment due to
memory complaints, invitation of community-dwelling elderly patients through radio advertisements, and referral from
other clinics for the assessment of suspected cognitive decline. All patients undergo regular clinical and neuropsychological
evaluations and have been diagnosed by a team of psychiatrists, neuropsychologists, and geriatricians in consensus meetings
as: AD, MCI, and normal controls (NC). Dementia was diagnosed accordingly to the DSM-IV criteria (American Psychiatric
Association, 1994); AD was diagnosed according to the NINCDS-ADRDA criteria (McKhaan et al., 1984). Diagnosis of MCI
was made according to Petersen’s criteria (Petersen et al., 1999). Potential participants with other neurological or psychiatric
disorders, other types of dementia, or clinically significant medical conditions (cardiovascular disease, diabetes and hyperten-
sion not adequately compensated by medication, and severe sensory limitations) were excluded from the study and referred to
other clinics. This research project was completed in accordance with the Helsinki declaration, and it was approved by the
hospital ethical committee, and all participants signed the approved informed consent form. The sample included in the
present study was randomly selected within each diagnostic group, and study participants were being followed-up regularly
at the memory clinic.
F.S. Pereira et al. / Archives of Clinical Neuropsychology 25 (2010) 335–343 337
Mental state examination was performed with the Brazilian version of the Cambridge Examination for Mental Disorders in
the Elderly (CAMDEX) semi-structured interview (Bottino et al., 1999; Roth et al., 1986; Nunes et al., 2008), which yields the
scores for the Cambridge Cognitive Test (CAMCOG), the Abbreviated Mental Test (Roth & Hopkins, 1953), the Mini-Mental
State Examination (MMSE; Brucki, Nitrini, Caramelli, Bertolucci, & Okamoto 2003; Folstein, Folstein, & Mchugh, 1975), and
the Hachinski Ischemic Score (Graham et al., 1997). The Clock Drawing Test, which is part of the CAMCOG schedule, was
additionally scored accordingly to Sunderland’s criteria (Sunderland, Hill, & Mellow, 1989). The 21-item Hamilton
Depression Scale (HAM-D) (Hamilton, 1960) was administered to rule out depressive symptomatology.
A complete neuropsychological examination was conducted by trained psychologists and the results were used for
diagnostic purposes. A description of this protocol may be found in previous studies describing this sample (Diniz et al.,
2008; Nunes et al., 2008; Pereira, Yassuda, Oliveira, & Forlenza, 2008). Clinical assessment (including the application of
the CAMCOG and HAM-D) and neuropsychological testing occurred on two separate days and each lasted approximately
90 min. DAFS-BR was completed on a third consecutive visit.
DAFS-R assesses seven different domains of functional abilities by requiring the participant to carry out different tasks. The
In the present study, the methodological steps recommended by internationally recognized publications for the cultural
adaptation of psychometric instruments were followed (Guillemin, 1995). Permission to adapt the DAFS-R to Brazilian
338 F.S. Pereira et al. / Archives of Clinical Neuropsychology 25 (2010) 335–343
Portuguese and to use it for research purposes was received from its authors. A multidisciplinary panel composed of geriatric
psychiatrists and neuropsychologists was involved in the cultural adaptation.
The American version of DAFS-R was translated into Portuguese by two certified translators not affiliated with the study. After
translation, these two versions were compared and adapted by the panel of specialists who created a single version in Brazilian
Portuguese (version 1). Version 1 was then back translated by a third certified translator, not familiar with the study, to English
which originated a new version in English (version 2). The original and version 2 were compared by the multidisciplinary panel
and found to be equivalent, due to the fact that there were very few linguistic discrepancies between these two versions.
Version 1 was then revised by the panel in order to adapt the scale stimuli taking into account Brazilian culture, generating
the revised Brazilian version (version 3). All modifications made are reported in the Appendix. The authors attempted to main-
tain the Brazilian version as similar as possible to the American one, so that cognitive demands would remain comparable. In
this regard, the authors decided to retain the original telephone numbers with seven digits instead of eight, as it is currently used
in Brazil. The authors felt that in some domains maintaining task demands was more important than cultural adjustment.
Pilot testing was then carried out in a group of 10 randomly selected patients from the larger sample of the cognitive study.
Problems related to task comprehension were not identified. Researchers soon realized that most participants in the sample had
difficulties in interpreting road signs, because they never drove or had retired from the road for many years. This was true for a
significant number of participants in the total study sample. On the basis of this information, version 3 was modified and the
Statistical Analysis
Statistical analyses used SPSS for WINDOWS, version 14.0 (SPSS Inc., Chicago, IL, USA). The means for socio-
demographic and clinical variables for the three groups were compared by means of ANOVAs, and two-by-two comparisons
were done with the post hoc Tukey tests. Parametric tests were used because all variables followed normal distribution. Internal
consistency was calculated using Cronbach’s a. Intraclass correlation coefficients (ICCs) were used to assess stability over time
and reliability between observers. Spearman’s r correlation was used to assess the association between the DAFS-BR and the
IQCODE, due to a lack of normal distribution within diagnostic groups. Receiver operating characteristic (ROC) curves were
used to estimate the best DAFS-BR cut-off scores to discriminate diagnostic groups, considering consensus diagnosis as the
gold standard. The critical p-value for statistical significance was set at p ¼ .05.
Results
Sociodemographic information about this sample has been presented in an earlier study (Pereira et al., 2008). The sample
was predominantly women (75% of controls, 74% of MCI, and 58% of AD), with a mean age of 73.8 years (+6.7) and mean
years of education was 10.3 (+6.0). AD patients were older than controls and MCI (77.9 + 6.0, 71.6 + 5.6, and 72.6 + 7.0
years of age, respectively, p ¼ .001), although the difference between the latter two groups was not significant (p ¼ .915; post
hoc Tukey test). The current sample did not comprise illiterates; although controls were slightly more educated than MCI and
AD (13.2 + 6.0, 8.5 + 5.5, and 8.8 + 5.5 years of formal schooling, p ¼ .002), all subjects in the sample had attained at least
elementary school.
Table 1 reports mean scores for cognitive and functional variables. Results indicate that the diagnostic groups were different
for all instruments, with the exception of the MMSE, when NC and MCI had equivalent mean scores.
Table 1. Mean test scores (SD) for NC and patients with MCI and AD
NC (n ¼ 32) MCI (n ¼ 31) AD (n ¼ 26) p-valuea
Table 2. Mean DAFS-BR subitem scores (SD) for NC and patients with MCI and AD
Time Orientation (0 –16) 15.2 (2.7) 15.6 (1.0) 9.7 (4.8)a ,.001
Communication skills (0– 15) 14.1 (2.5) 13.6 (1.4) 9.6 (3.3)a ,.001
Dealing with Finances (0–32) 27.4 (4.5) 21.3 (5.3)b 13.8 (6.5)a ,.001
Shopping skills (0 –20) 16.6 (1.2) 14.5 (2.3)b 7.1 (2.9)a ,.001
Grooming skills (0 –13) 12.6 (2.0) 12.7 (0.7) 11.1 (2.8)a .007
Eating skills (0– 10) 10.0 (0.0) 10.0 (0.0) 9.9 (0.5)a .014
Notes: NC ¼ normal controls; MCI ¼ Mild Cognitive Impairment; AD ¼ Alzheimer’s disease; DAFS-BR ¼ Direct Assessment of Functional Status-Brazilian
version. p-values refer to ANOVAs. The post hoc Tukey test for pairwise comparisons between DAFS-BR subscores.
a
AD significantly different from controls and MCI.
b
MCI significantly different from controls.
Notes: ICC ¼ intraclass correlation coefficient; DAFS-BR ¼ Direct Assessment of Functional Status-Brazilian version. For all subdomains, interobserver and
test –retest reliability p-value was ,.001.
Table 2 reports the mean scores for the DAFS-BR subitems. AD patients had significantly lower performance in Time
Orientation and Communication skills when compared with MCI and NC participants. Dealing with Finances and
Shopping skills differentiated the three groups. For Grooming and Eating skills, no significant differences were found
among the three groups.
Interobserver reliability and test– retest stability over time were high for all six subdomains and for total scores, and they are
presented in Table 3. For Grooming and Eating skills, test– retest agreement and interobserver agreement were 100%.
The instrument was found to have high internal consistency (Cronbach’s a ¼ .78). The evaluation of each subdomain
revealed that Grooming and Eating skills had the lowest correlations with the other subdomains and the total score, probably
due to the fact that the tasks involved in these domains are overlearned, performed automatically, and recruit limited cognitive
skills. Removing the domain with the lowest correlation (Eating skills) improved internal consistency slightly (Cronbach’s a ¼
0.80). The correlations for each subdomain and the total score were: Time Orientation, r ¼ .76; Communication skills, r ¼ .79;
Dealing with Finances, r ¼ .91; Shopping skills, r ¼ .87; Grooming skills, r ¼ .47; Eating skills, r ¼ .41.
The evaluation of convergent validity was carried out by comparing scores for the DAFS-BR and the IQCODE. In the total
sample, Pearson’s correlations indicated that DAFS-BR scores were moderately but significantly correlated with the IQCODE
scores (r ¼ 2.65, p , .001). However, Spearman’s correlations (due to a lack of normal distribution within groups) were not
significant when each diagnostic group was analyzed separately (for NC r ¼ 2.25, p ¼ .17; MCI r ¼ 2.32, p ¼ .12; and AD
r ¼ 2.39, p ¼ .13).
To evaluate the diagnostic accuracy of the DAFS-BR, ROC curve analyses were performed to compare pairs of diagnostic
groups (NC × MCI and NC × DA). Results presented in Table 4 indicate that DAFS-BR has excellent accuracy to discriminate
NC from AD and lower but still high sensitivity and specificity to separate MCI from NC (Figs. 1 and 2).
Discussion
The purpose of this study was to translate and culturally adapt the revised version of the DAFS-R into Brazilian Portuguese
(DAFS-BR) and to evaluate its reliability and validity, so that it can be used for the evaluation of functional performance in
340 F.S. Pereira et al. / Archives of Clinical Neuropsychology 25 (2010) 335–343
Table 4. Summary of the ROC analyses with cut-off scores for NC × AD and NC × MCI
Brazil. In clinical practice to date, available measures of functionality are based on reports offered by family members, care-
givers, or patients.
Present results suggest that DAFS-BR maintains its original psychometric characteristics. Reliability measures indicated
excellent stability for all subitems and for the total scores of DAFS-BR. In spite of the linguistic and cultural differences,
ICCs were high and similar to those reported in the original study (k-values for inter-rater agreement ¼0.90 and k-values
for test– retest stability over time ¼.89; Loewenstein et al., 1989) and for the German version after cross-cultural adaptation
(inter-rater agreement r ¼ .97; test– retest stability over time r ¼ .98; construct validity r ¼ .86; Hochrein et al., 1996).
Additionally, internal consistency measures demonstrated that the subdomains seem to address the same underlying dimension.
F.S. Pereira et al. / Archives of Clinical Neuropsychology 25 (2010) 335–343 341
The present study is also one of the few available in the literature to compare AD, MCI, and NC on objective measures of
functional status (Pereira et al., 2008; Wadley, Okonkwo, Crowe, & Ross-Meadows, 2008). On a related note, Griffith and
colleagues (2003) also assessed financial capacity in patients with MCI using a direct approach and found that this group
demonstrates impairment across a range of financial abilities. Other authors have also suggested that diagnostic criteria for
MCI should specify mild functional deficits due to the imprecision observed during the execution of complex tasks
(Giovannetti et al., 2008). Older adults diagnosed with MCI may show subtle impairment in aspects of functionality which
require complex cognitive processing, such as in DAFS-BR Dealing with Finances and Shopping skills. Previous studies
have documented that memory and executive functions play an important role in instrumental activities of daily living
(e.g., Schmitter-Edgecombe, Woo, & Greeley, 2009). Therefore, it is plausible that MCI patients present subtle functional
limitations due to cognitive deficits. Present results also suggest that some DAFS-BR subdomains seem more sensitive to cog-
nitive decline.
The evaluation of convergent validity revealed a statistically significant correlation between the DAFS-BR and the
IQCODE for the total sample. Yet, within diagnostic groups, correlations were not significant. This result seems to
suggest that there may be a fair amount of disagreement between objective and subjective evaluations of functionality,
and the level of agreement may be influenced by the degree of cognitive impairment. Disagreement between the two instru-
ments is higher when cognitively unimpaired older adults are evaluated on functional status, perhaps because subtle
Funding
The present work was supported by Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP, Project 02/13633-7).
The Laboratory of Neuroscience (LIM-27) receives financial support from Associação Beneficente Alzira Denise Hertzog da
Silva (ABADHS).
342 F.S. Pereira et al. / Archives of Clinical Neuropsychology 25 (2010) 335–343
Conflict of Interest
None declared.
References
Abreu, I. D., Nunes, P. V., Diniz, B. S., & Forlenza, O. V. (2008). Combining functional scales and cognitive tests in screening for mild cognitive impairment at
a university-based memory clinic in Brazil. Revista Brasileira de Psiquiatria, 30 (4), 346–349.
Allaire, J. C., Gamaldo, A., Ayotte, B. J., Sims, R., & Whitfield, K. (2009). Mild cognitive impairment and objective instrumental everyday functioning: The
everyday cognition battery memory test. Journal of the American Geriatric Society, 57 (1), 120– 125.
American Psychiatric Association. (1994). Diagnostic and statistical manual of mental disorders (4th ed.). Washington, DC: Author.
Bottino, C. M. C., Almeida, O. P., Tamai, S., Forlenza, O. V., Scalco, M. Z., & Carvalho, I. A. M. (1999). Entrevista Estruturada para Diagnóstico de
Transtornos Mentais em Idosos. PROTER (Structured Interview for the Diagnosis of Mental Disorders in the Elderly), Instituto de Psiquiatria do
Hospital das Clı́nicas da Faculdade de Medicina da USP.
Brown, C. (2009). Functional assessment and intervention in occupational therapy. Psychiatric Rehabilitation Journal, 32 (3), 162–170.
Brucki, S. M. D., Nitrini, P., Caramelli, P., Bertolucci, P. H. F., & Okamoto, I. H. (2003). Sugestões para o uso do Mini-Exame do Estado Mental no Brasil
(Suggestions for the use of the Mini Mental State Examination). Arquivos de Neuro-Psiquiatria, 61 (3-B), 777–781.
Desai, A. K., Grossberg, G. T., & Sheth, D. N. (2004). Activities of daily living in patients with dementia: clinical relevance, methods of assessment and effects
of treatment. CNS Drugs, 18, 853–875.
Diniz, B. S., Nunes, P. V., Yassuda, M. S., Pereira, F. S., Flaks, M. K., Viola, L. F., et al. (2008). MCI: Cognitive screening or neuropsychological assessment?
Revista Brasileira de Psiquiatria, 30 (4), 316– 321.
Folstein, M. F., Folstein, S. E., & Mchugh, P. R. (1975). Mini-Mental State: A practical method for grading the cognitive state of patients for the clinician.
Journal of Psychiatric Research, 12, 189– 198.
Giovannetti, T., Bettcher, B. M., Brennan, L., Libon, D. J., Kessler, R. K., & Duey, K. (2008). Performance-based analysis of everyday action performance in
mild cognitive impairment. Dementia and Geriatric Cognitive Disorders, 25, 359– 365.
F.S. Pereira et al. / Archives of Clinical Neuropsychology 25 (2010) 335–343 343
Graham, J. E., Rockwood, K., Beattle, B. L., Eastwood, R., Gauthier, S., Tuokko, H., et al. (1997). Prevalence and severity of cognitive impairment with and
without dementia in an elderly population. Lancet, 349, 1793– 1796.
Griffith, H. R., Belue, K., Krzywanski, S., Zamrini, E., Harrel, L., & Marson, D. C. (2003). Impaired financial abilities in mild cognitive impairment: A direct
assessment approach. Neurology, 60, 449–457.
Guillemin, F. (1995). Cross-cultural adaptation and validation of health status measures. Scandinavian Journal of Rheumatology, 24 (2), 61–63.
Hamilton, M. (1960). A rating scale for depression. Journal of Neurology, Neurosurgery and Psychiatry, 23, 56–62.
Hammond, A. (1996). Functional and health assessments used in rheumatology occupational therapy: A review and United Kingdom survey. British Journal of
Occupational Therapy, 59 (6), 254– 259.
Hochrein, A., Jonitz, L., Hock, L., Bell, V., Plaum, E., & Engel, R. R. (1996). Quantification of dementia-related disabilities in daily behavior with the DAFS
(Direct Assessment of Functional Status): Reliability and validity of a German test version. Zeitschrift für Gerontologie und Geriatrie, 29 (3), 216– 222.
Holm, M. B., & Rogers, J. C. (2008). Performance assessment of self-care skills. In B. Hemphill-Pearson (Ed.), Assessments in occupational therapy mental
health: An integrative approach (2nd ed., pp. 101 –112). Thorofare, NJ: Slack Incorporated.
Houston, D., Williams, S. L., Bloomer, J., & Mann, W. C. (1989). The Bay Area Functional Performance Evaluation: Development and standardization.
American Journal of Occupational Therapy, 43 (3), 170– 183.
Jorm, A. F., & Jacomb, P. A. (1989). The Informant Questionnaire on Cognitive Decline in the Elderly: Sociodemographics correlates, reliability, validity and
some norms. Psychological Medicine, 19, 1015–1022.
Loewenstein, D. A., Amigo, E., & Duara, R. (1989). A new scale for the assessment of functional status in Alzheimer’s disease and related disorders. Journals
of Gerontology, 4, 114–121.
Loewenstein, D. A., Argüelles, S., Bravo, M., Freeman, R. Q., Argüelles, T., Acevedo, A., et al. (2001). Caregivers’ judgments of the functional abilities of the