key: cord-0903404-8am53cm3 authors: Basannar, D. R.; Singh, Sumeet; Yadav, Jyoti; Yadav, Arun Kumar title: Quantifying Age Heaping and Age Misreporting in a Multicentric Survey date: 2022-03-16 journal: Indian J Community Med DOI: 10.4103/ijcm.ijcm_1179_21 sha: de7866dc2fa67d534ea616367547b9c2de953629 doc_id: 903404 cord_uid: 8am53cm3 INTRODUCTION: Demographic indices known as the age-heaping indexes were used to explore the patterns of age misreporting in a multicentric survey. METHODS: The data of 3252 individuals were analyzed, and measurement of errors in age for the sampled data has been evaluated by Whipple's Index (WI), Myer's Blended Index, and United Nations Age–Sex Accuracy Score which comprises Sex Ratio Score, Male Age Ratio Score (ARS), and Female ARS. RESULTS: Out of total 3252 participants, 828 (25.5%) were female. The mean statistical division age of our population was 34 (8.5) years and ranged from 15 to 65 years. The percentage of female ages ending with digits 0 or 5 is 23.55% and percentage of male ages ending with digits 0 or 5 is 23.28%. The calculated WI was 117.75 and 116.34 for males and females, respectively. The calculated Myer's Index for females and males is 10.53 and 25, respectively. CONCLUSION: The study provides evidence that probably age-heaping bias is less of problem in the conducted study. A large number of multicentric surveys are being continuously conducted among Armed Forces personnel, for which the data on age are regularly collected. Armed Forces personnel differ in many ways from the census population of India in being having gone through mandatory criteria of qualification, rigorous training, and specialized military education. As of now, there is no study in the Armed Forces population to estimate age heaping. This would have an implication on all the surveys undertaken as well as future surveys as the study is based upon the randomization on the last digit of age. Keeping in view the importance of age heaping, the present study has been undertaken to assess the level of age-heaping bias among 3252 serving personnel in a study of seroprevalence of COVID antibodies who are deployed in different geographical locations across the country. The study was conducted on the secondary data collected for a multicentric study for serosurvey in Armed Forces. [8] The principal investigator of each center collected the data on the age at each center by asking the completed age in years. Measurement of errors in age for the sampled data has been evaluated by Whipple's Index (WI), Myer's Blended Index, and United Nations Age-Sex Accuracy Score. WI is the simplest and most widely used index and is a measure of age heaping on ages ending in 0 or 5. [9] The Whipple score is calculated by dividing the sum of the populations at single ages by one-fifth of the sum of the populations from age 23-62 and multiplied by 100. The total score ranges from 100 to 500, with a score <105 being highly accurate, and it decreases as the score rises, with >175 being very rough. Symbolically, It is one of the most accurate measure heaping toward certain digits by blending the population in a particular way that each terminal digit has an equal sum, based upon the assumption that the population is equally distributed among the different ages. It is calculated for the age above 10 years and expressed as percentages. The first step is the sum of populations ending in each digit over the whole range starting with the lower limit of the range, for example, 10, 20, 30, 40…; 11, 21, 31, and so on. It is followed by ascertaining the sum excluding the first population combined in the first step, for example, 20, 30, 40.; 21, 31, 41, and so on. The next step is to weigh the sums in steps 1 and 2 and add the results to obtain a blended population. This distribution is then converted into percentages and takes the deviation of each percentage from 10.0, which is the expected value for each percentage. Finally, the summary index of preference for all terminal digits is derived as one-half of the sum of the deviations from 10.0%, each without regard to signs. The combined score of Myer's Index ranges from 0 to 180, whereas for individual digit, it varies from 0 to 90. Accuracy is inversely proportional to the score. The lower the score, the higher the accuracy and the deviation indicates the heaping. Symbolically, it is obtained by arranging the matrix of the population in a standard format, as shown in Table 1 . [4] Age Ratio Score Age Ratio Score (ARS) is defined here as the ratio of the population in a given age group to one-third the sum of the population in that age group and in the preceding and following groups, multiplied by 100. It is calculated for age up to 74 years. Symbolically, [ ] 5 a a 5 a-5 5 a 5 a+5 P ×100 ARS for 5 P = 1/ 3 P + P + P When the quality of data is good, there is a gradual change from one age group to another. However, data show fluctuations in the case of poor quality. It is mainly used for comparative analysis in cross-sectional studies. There are no minimum or maximum values, and the weight of three attached to sex ratios is arbitrary based on empirical findings. Symbolically, U. N. Joint Score (JS) =3 (sex reassignment surgery) + Male ARS + Female ARS (FARS). JS of <20 is considered accurate, 20-40 is inaccurate, and over 40 is highly inaccurate. For the study, age and sex data were collated in MS Excel, and all the scores and indexes were calculated in MS Excel itself. The two authors independently calculated all the scores. The discrepancy was resolved with mutual discussion. Out of total 3252 participants, 828 (25.5%) were female. The mean (standard division) age of our population was 34 (8.5) years and ranged from 15 to 65 years. The age-and sex-wise distribution of the population is shown in Figure 1 . This study was conducted to ascertain age-heaping bias in a multicentric study among Armed Forces personnel. We found that as per WI, the data quality is fairly accurate and Myer's Index illustrates low age heaping. As mentioned earlier, Armed Forces personnel differ from the census population, so the gradual change in ARS is not expected as all study participants are from the selected age group, i.e., from age 18 to 45 years. In comparison with the National Census Report 2011 which showed poor quality of age data due to heaping, this study illustrates better quality of age data which can contribute to more literacy level of participants compared to average literacy rate of India and the meticulous system of data collection in Armed Forces. A study conducted among 4304 people in 823 households in a community survey in the Yavatmal district of Maharashtra state showed that the collected age data are of poor quality with age heaping at ages with terminal digits. [5] One's unawareness of its own age manifests as age-heaping bias. The chances of misreporting of age are rarely stated in the research. Although approximation of the age is common, but this approximation is not random or the individual do not choose age randomly but they have a preference for the digits. With some individual's preference is for the digits 0 and 5, while other having preference for even numbers. Hence, age heaping indicates lack of awareness of age in the society, and a strong correlation has been found with age-heaping bias and illiteracy and can also be used as an indicator of human capital. [10] The misclassification of the age by bias can lead to wrong assessment of denominator in age groups and calculation of wrong epidemiological rates, leading to erroneous planning and policies. In our study population, the accuracy of age data was acceptable. This may be because our population is more educated and has yearly events such as Annual Medical Examination, Periodic Medical Examination, and career-oriented courses where individuals would recall their age. Nevertheless, age heaping is less of problem in survey conducted in this particular study, but it is strongly recommended that checking errors while acquiring age data should be done periodically and the researcher should have an inbuilt system in the data collection tool to minimize age rounding/heaping, especially in the community field surveys where age heaping and misreporting can significantly distort the measures of association/treatment effect. Nil. There are no conflicts of interest. ----------------9 X 9 Y 9 10 0 Z 10 Z 10 /∑Z i P 10 Total* ∑X i ∑Y i ∑Z i ∑P i *Total irrespective of sign Bias, confounding, and interaction: Lions and Tigers, and Bears Research Methods in Health: Investigating Health and Health Services. 4 th ed The anatomy and physiology of clinical research Bias, confounding, and effect modification Age heaping and accuracy of age data collected during a community survey in the yavatmal district, maharashtra Age heaping patterns in Mosaic data Age heaping in recent national censuses Seropositivity of severe acute respiratory syndrome coronavirus 2 infection among healthcare workers of the Armed Forces medical services, India: A multicentric study Quality of age reporting: Extension and application of the modified Whipple's index Quantifying quantitative literacy: Age heaping and the history of human capital