Comparison of text messaging data collection vs face-to-face interviews for public health surveys: a cluster randomized crossover study of care-seeking for childhood pneumonia and diarrhoea in rural China

Michelle Helena van Velthoven; Wei Wang; Qiong Wu; Ye Li; Robert W Scherpbier; Xiaozhen Du; Li Chen; Yanfeng Zhang; Josip Car; Igor Rudan

doi:10.7189/jogh.08.010802

Childhood diarrhoea and pneumonia are the leading infectious causes of death in children under five globally [1]. Coverage of interventions that could prevent many deaths is too low in low- and middle-income countries [2,3]. Information on care-seeking is essential to implement interventions but is limited in these countries where health information systems are weak [4-6]. Household surveys are the primary source of child health indicators in low- and middle-income countries [7]. However, the current face-to-face surveys are costly and time-consuming to undertake and cannot be conducted on very large samples or frequently.

Costs are one of the biggest constraints for designing and implementing household surveys. Costs for field visits include personnel, transport, accommodation, equipment, consumables and other costs. Recruitment and training of interviewers and supervisors is labor-intensive, time-consuming and expensive [8]. Moreover, interviewers can have difficulty accessing households because of poor transportation, caregivers’ unavailability and security concerns. There are difficulties in gaining access to particular areas, for example during the wet season when roads are difficult or impossible to pass [9]. Caregivers often work outside home during day-time or are too busy to be interviewed. Moreover, some cultural customs prevent interviewers from visiting households. For example, in rural China people believe that it is not good for newborns and mothers to be visited by people within the first months after birth [10]. It is usually not feasible to conduct household surveys that include very large samples of participants. However, a large sample size is needed for disaggregated analysis by sex, age and socio-economic position [11] and for obtaining adequate denominators to support coverage measurement when the two-week point prevalence of events is very low [3]. Household surveys cannot often be conducted frequently. Regular household surveys carried out according to minimum standards are required to provide frequent data for programme monitoring [12].

In addition, these surveys have issues with interviewer variance and influence, privacy, confidentially and data quality control [13]. Most household surveys are sample surveys in which representative samples are preselected with each household having a known chance to be selected. In the Chinese study context, a list with names of children is obtained through routine health information systems. A common problem is the low quality of health information systems and lists of names of people are often incomplete and inaccurate [4,5]. Interviewers can introduce bias in household surveys. Demographic and Health Surveys and Multiple Indicator Cluster Surveys have minimum requirements for selecting interviewers. This means that interviewers need at least a high school diploma and cannot directly be involved in the management or provision of health services to avoid potential conflict of interest [7]. In addition, all interviewers are trained according to the survey protocol and evaluated before conducting field work. However, interviewer bias cannot be completely avoided, because interviewers can influence interviews and introduce response bias and socially desired answers. Additionally, concerns have been raised about data quality issues and interviewers fabricating data in low- and middle-income countries [14]. Experiences indicate that interviewers sometimes modify children’s age, for example by transferring children to an age group of over five years, to exclude them from the survey sample and thereby reduce their workload [7].

Self-administered surveys based on mobile phones offer a promising alternative to the current interviewer-administered household surveys [15]. By the end of 2016, there were an estimated 7.5 billion mobile phone subscriptions [16]. Text messaging is popular and user-friendly, because of its immediacy, low costs and non-intrusiveness [17]. Text messaging could be used for real-time data collection at the household level and has the potential for scale-up at low cost in different settings [13]. Text messaging can overcome geographical barriers and be used to contact people who are difficult to reach [18].

However, the mode of data collection can have effects on data quality [19]. While self-administered surveys are not biased by an interviewer and can be completed at a private location, these surveys often have a lower response rate than interviewer-administered surveys [19]. There is a greater risk that respondents misinterpret questions in self-administered surveys and thus these surveys require excellent questionnaire design [20]. Furthermore, text messaging data collection faces several challenges including coverage of networks, access to mobile phones, payment, and usage of text messaging [21].

To inform this study on current evidence for use of text messaging for health data collection, one reviewer searched the English literature (MV). One reviewer (QW) searched Chinese databases (Wanfang Data and China National Knowledge Infrastructure) but found no studies. Relevant search terms ((phon*, mobil* OR mHealth OR “m health” OR m-health OR eHealth OR telemedicine [MeSH]) AND (data OR information OR collect* OR gather* OR obtain* OR monitor* OR data collection [MeSH])) were used. Both English language databases (The Cochrane Central Register of Controlled Trials, PubMed, EMBASE, World Health Organization Global Health Library regional index, PsycINFO, Web of Science, MobileActive and Royal Tropical Institute “KIT” Information Portal mHealth in Low-Resource Settings). This review excluded studies where health workers were using mobile phones to collect information and our studies that were published after the review was conducted [22,23].

A total of 19 studies reported in 21 papers [24-44] evaluated use of text messaging for collection of health-related information, which showed to be acceptable to participants [28,30,33,34,43,44] and easy to use [36, 37] (Table S1, S2 and S3 in Online Supplementary Document). The completion rate varied widely from 15% [41] to 100% [37]. Eight out of 10 studies that compared text messaging with other forms of data collection assessed data equivalence [26,28,34-36,39,41,42]. Data equivalence was high when comparing text messaging to: telephone interviews on influenza vaccination [41]; telephone interviews and health visitor interviews on infant feeding [28]; telephone interviews for 1-week and 1-month recall of low back pain [34]; an Internet survey and paper survey on sexual behaviour [35]; verbal assessment of pain in presence of physician [36]; a baseline Internet survey on usual quantity of drinks [39]; and paper cards on satisfaction of primary care consultation [42]. Data equivalence was low for the 1-year recall of low back pain by telephone interviews [34], and for a baseline Internet survey about the number of drinks [39] compared with data collected via text messaging. However, many studies did not compare data collection methods, used small samples, or had a non-randomized study design. Moreover, few studies took place in a low- or middle-income country [24,33,37]. No studies were found on using text messaging for household data collection in LMIC. We conducted two feasibility studies on the collection of information related to infant feeding via text messaging in China [22,23].

To address these gaps in the literature, build on our previous work and inform the use of text messaging data collection in LMIC, we aimed to assess the use of text messaging for a survey on childhood diarrhoea and pneumonia in rural China by addressing the following objectives. First, to assess for the text messaging survey: (i) the response rate; (ii) differences between responders and non-responders; and (iii) the error proportion. Second, to compare text messaging to the standard face-to-face interview method in terms of: (i) data equivalence; (ii) the amount of information in responses; and (iii) reasons for differences in responses. A cluster randomized design was used as participants could not be randomized on an individual level for organizational reasons.

METHODS

Design

This study was part of a larger project described elsewhere, of which overall aim was to explore the application of mHealth-based collection of information relevant to childhood diarrhoea and pneumonia in rural China [45]. We compared a face-to-face survey (reference standard) to a text messaging survey (novel method). We randomised villages (clusters) into two groups. Participants were caregivers of young children. Participants either completed the face-to-face survey first and then the text messaging survey (group 1) or the other way around (group 2). We used a crossover design to allow assessment of whether differences were mode related.

Study setting and sample

The study took place in Zhao County which is located in Hebei Province, situated in the northern part of the North China Plain with an area of 190 000 km² (for comparison, the size of the United Kingdom is 245 000 km²), bordering the capital Beijing. Zhao County has a total population of 571 000, with 518 000 people (90.7%) living in rural areas. Zhao County covers an area of 675 km² and is located 40 km south of Shijiazhuang City. The socioeconomic development of Zhao County is similar to Hebei Province, similar to the national average. The annual per capita net income was ¥ 6464 (about US$ 1026) for residents in Zhao County, ¥ 5958 (about US$ 946) for residents of Hebei Province, and the national average was ¥ 5919 (about US$ 946) in 2010 [46,47]. The female illiteracy rate is low (3.8%) and the main ethnic group is Han (99.9%) (data from 2010 provided by the Zhao County Statistics Bureau, unpublished). A survey among 1601 caregivers of young children in Zhao County showed that 99.4% of households had at least one mobile phone. Moreover, 61.2% of the households owned computers, with 54.8% having access to Internet (QW, personal communication). Zhao County has four hospitals at county level: (i) a public general hospital; (ii) a public maternal and child health hospital; (iii) a public traditional Chinese medicine hospital; and (iv) a private general hospital. Each of the 16 townships has a public township hospital and all the 281 villages have a village clinic. Village clinics are often privately-owned by village doctors who receive small subsidies from the government for providing public health services. Village doctors live in the communities they serve and have a good relationship with villagers. Village doctors provide primary health care at village level and are trained and supervised by staff at township and county-level. Education and training of village doctors varies, but usually they have at least primary school or junior high school and short basic medical training [48,49]. The study took place in one township, Zhaozhou Township, which has both a semi-urban and rural area and 46 villages.

Village doctors gathered caregivers in their village clinics where interviewers recruited them. Views of village doctors and caregivers on participation in this study are presented elsewhere [50]. Clusters were included if the village doctor was present and willing to participate as without their help caregivers could not be recruited for the study. The participant inclusion criteria were: had a child in the family younger than five years (the youngest child if there was more than one child), have access and able to use a mobile phone, able to text message, and provided consent. Family members, including grandparents, often take care of the children in China and were included to maximize the recruitment rate.

We aimed to recruit a reasonable number of caregivers in the study setting and generate parameters for sample size calculations in future studies. It was not possible to conduct an accurate sample size calculation for the crossover study, because it was a feasibility study and precise estimates from previous research that could inform a calculation were unavailable. The local township hospital provided a list of 4170 names of children younger than five years who lived in Zhaozhou Township (almost all women deliver in hospitals in China). However, the accuracy of this list of names was not known and based on previous research experiences we expected that a significant number of children on the list were not living in the villages of the list of names.

Randomization

The local health officials provided consent before randomization. Our statistician (WW) randomized villages using SAS version 9.2 [51]. Figure 1 shows that the 46 villages were randomized as follows: 16 villages (with 1600 children) into group 1 and 30 villages (with 2570 children) into group 2. The villages were ranked based on the size of their under-five population into three strata of 15, 15 and 16 villages each. An independent statistician provided a list of random numbers to determine the strata that had 16 villages. WW used block-randomization with a ratio of 1:1.6 to allocate a larger proportion of participants to group 2 and to account for the expected higher drop-out (participants had to come back to the village clinic for the face-to-face survey). WW enrolled the villages and assigned them to group 1 and group 2. Differences between group 1 and 2 were the sequence of the face-to-face and text messaging surveys and group 2 were asked about their reasons for differences in responses.

Figure 1. Flow diagram of study participants.

Questionnaires

The face-to-face questionnaire included questions on identification (including mobile phone number) and demographics (about 30 questions), and 17 on care-seeking for childhood diarrhoea and pneumonia (Table S4 in Online Supplementary Document). These questions were taken from the World Health Organization Maternal, Newborn and Child Health Household survey that has been used in China since 2010 (2009 version, unpublished). In addition, the questionnaire had questions on mobile phone usage developed by the research team.

A subset of 17 questions on care-seeking was also administered via text messaging and compared between the methods (Table S4 in Online Supplementary Document). Moderate changes had to be made to the face-to-face questionnaire to make it appropriate for a text messaging survey format. It was aimed to develop a text message questionnaire in which the text message questions were interpreted by caregivers in a similar way as the face-to-face questions. Cognitive interviewing and usability testing was an adequate research strategy for this purpose. The questions are reported by the number of the text message in which they were asked (text message 4 to 20) [45]. The text messaging survey had a total of 19 questions. In addition to the 17 comparison questions, first we asked caregivers to confirm that they were willing to participate (text message 2) and about the relationship between the caregiver and the child (text message 3). In addition to these questions, text message 1 greeted and text message 21 thanked participants.

The text message questions were developed in the following way. First, we converted the face-to-face survey questions into questions suitable for text messages. Second, we tested the questions using cognitive interview techniques to check understanding of the meaning of the questions [52]. Third, we conducted a pilot in which we sent text messages to 217 caregivers from another township in Zhao County (Shahedian Township) and revised the questions based on the results [45].

Recruitment and data collection

The study took place between 16 and 28 March 2013. We asked village doctors to gather all caregivers of young children in their village clinic.

In group 1, trained interviewers obtained informed consent and administered the entire face-to-face questionnaire. After one day, we sent the text messages for two days during which participants had to complete the survey, with 2 reminder text messages if they did not respond.

In group 2, interviewers obtained informed consent and administered the face-to-face questionnaire, but did not ask the questions about care-seeking (see Figure 2). Then we sent the text messages. Participants who responded to text message 4 (first question that was compared between the methods) were invited to visit the village clinic one for the face-to-face questions on care-seeking one day after finishing the text messaging survey (similar recall period as in group 1). Directly after completing the face-to-face questions, participants who did not give the same response in both surveys were asked about their reasons for this.

Figure 2. Item response rate (%) and completion rate for the total group. (N = 1014), group 1 (n = 371) and group 2 (n = 643).

Interviewers used smartphones to record responses in the face-to-face interview [53]. Participants received a towel (worth ¥ 5, about US$0.82, £ 0.52, € 0.62) for completing the face-to-face survey. One researcher (YL) manually sent and received text messages using a Chinese text messaging system (Shāng jī bǎo 商机宝). A second trained researcher checked the text messages that were sent out to prevent errors. We paid participants ¥ 1 back for text message costs (sending one text message in China costs ¥ 0.10) and ¥ 5 when they completed the text message survey (thus participants received ¥ 1 when they sent at least one text message and ¥ 6 when they answered all questions). All data were wirelessly and securely transferred into an Excel database via an internet server and could only be accessed by the researchers involved in this study.

Outcomes and analysis

SPSS version 16.0 [54] and SAS version 9.2 [51] were used for analysis. We assessed the item response rate as the proportion of participants who participated and responded to individual questions and completion rate as the proportion of participants who participated and completed the survey. We compared participants who responded to those who did not respond to text message 2 (responders vs non-responders) and participants who completed the survey to those who did not complete the survey (completers vs non-completers) in group 1. Characteristics were compared using the Pearson χ² test and Fisher Exact test for nominal variables and Mann-Whitney U/ Wilcoxon W (MWU/WW) test for not normally distributed continuous variables and ordinal variables. We calculated the error proportion of the text messaging method, which was evaluated by incorrect text message questions that were sent to participants and incorrect text message answers that were received from participants. We counted the reasons for differences in responses.

We calculated kappa values and the proportion of agreement for data equivalence and the amount of information. For these analyses, we included the participants who responded to text message 4 and were the same caregivers participating in both surveys. We calculated the results for group 1 and 2 together and for the two groups separately. We present data equivalence for 15 nominal questions of which 10 were dichotomous (text message 4, 5, 6, 7, 8, 11, 14, 15, 16, 18) and 5 were non-dichotomous (text message 12, 13, 17, 19, 20). We present Cicchetti-Allison and Fleiss-Cohen weighted kappa (two commonly used kappa weights) for two ordinal questions (text message 9 and 10a) [55]. Kappa values have the following meaning: <0.00 poor; 0.00-0.20 slight; 0.21-0.40 fair; 0.41-0.60 moderate; 0.61-0.80 substantial; and 0.81-1.00 almost perfect [56].

Ethics approval

The Ethical Committee of the Capital Institute of Pediatrics in Beijing provided ethical approval (No. SHERLL 2013009). All participants provided written informed consent for both the face-to-face and text messaging survey prior to their inclusion in the study. We anonymized participant identifiable information for data analysis and reporting.

RESULTS

Participants

Figure 1 shows the flow diagram of participants. Of the 4170 children on list of names, we were able to recruit 1026 caregivers in 42 villages and we included 1014 participants: 371 in 15 villages in group 1 and 643 in 27 villages in group 2 (Table S5 in Online Supplementary Document).

The remaining 3144 caregivers of children (4170 minus 1026) could not be recruited, because they were not present in the village, the list of names was incorrect, they did not meet the inclusion criteria or village doctors were unavailable (in one village in group 1 and in three villages in group 2). The specific numbers for each reason were not possible to register because the accuracy of the name list was unknown. We excluded 12 caregivers for the following reasons: the child of one caregiver just reached the age of five; we did not send text messages to three caregivers in group 1 by an administrative mistake, and we could not identify which child belonged to the text message responses for eight caregivers.

Of 1014 included participants, there were 796 mothers (78.5%), 141 fathers (13.9%), 53 grandmothers (5.2%), 22 grandfathers (2.1%) and 2 aunts (0.2%). Out of 16 characteristics that we had available, 7 had significant differences between the groups (Table 1). In group 1 there were more children with an urban Hukou (P < 0.001), a higher proportion of participants were mothers (P = 0.003), a higher proportion of participants were the primary caregiver of the child (P = 0.001), mothers (P = 0.03) and fathers (P = 0.007) had a higher education level, fathers had a higher median number of years of education (P = 0.046), the family income was lower (P = 0.03) compared to group 2. Out of the 13 indicators related to mobile phone usage, 1 had a significant difference between the groups (Table S6 in Online Supplementary Document).

Table 1. Characteristics of children under-five and caregivers

Variables	Total (N = 1014)	Group 1 (n = 371)	Group 2 (n = 643)	Comparison
Variables	Total (N = 1014)	Group 1 (n = 371)	Group 2 (n = 643)	Statistics*	P-value
Gender, n (%):				χ² = 0.03	0.87
Boy	561 (55.3)	204 (55.0)	357 (55.5)
Girl	453 (44.7)	167 (45.0)	286 (44.5)
Age child groups, n (%):				MWU/WW z = -0.41	0.68
0-11 months	212 (20.9)	74 (19.9)	138 (21.5)
12-23 months	279 (27.5)	112 (30.2)	167 (26.0)
24-59 months	523 (51.6)	185 (49.9)	338 (52.5)
Number of children, n (%):				MWU/WW z = -0.49	0.62
1	470 (46.4)	174 (46.9)	296 (46.0)
2	521 (51.4)	192 (51.8)	329 (51.2)
3	21 (2.1)	4 (1.1)	17 (2.6)
4	2 (0.1)	1 (0.2)	1 (0.2)
Mother’s age in years, median (Q1-Q3)	28 (26-31)	28 (26-31)	28 (26-32)	MWU/WW z = -0.30	0.76
Mother’s education level, median (Q1-Q3)†	3 (3-3)	3 (3-4)	3 (3-3)	MWU/WW z = -2.26	0.03
Mother’s number of years of education, median (Q1-Q3)	9 (9-9)	9 (9-11)	9 (9-9)	MWU/WW z = -1.41	0.16
Mother’s occupation, n (%):				Fisher exact test	0.33
Home	496 (48.9)	171 (46.1)	325 (50.5)
Work	515 (50.8)	199 (53.6)	316 (49.1)
Do not know	3 (0.3)	1 (0.3)	2 (0.4)
Father’s age in years, median (Q1-Q3)	29 (26-32)	29 (27-32)	29 (26-32)	MWU/WWz-0.15	0.88
Father’s education level, median (Q1-Q3)†	3 (3-3)	3 (3-4)	3 (3-3)	MWU/WW z = -2.68	0.007‡
Father’s number of years of education, median (Q1-Q3)	9 (9-9)	9 (9-12)	9 (9-9)	MWU/WW z = -2.00	0.046‡
Father’s occupation, n (%):				Fisher exact test	0.51
Home	12 (1.2)	6 (1.6)	6 (0.9)
Work	998 (98.4)	363 (97.8)	635 (98.8)
Do not know	4 (0.4)	2 (0.6)	2 (0.3)
Relationship to the child, n (%):				Fisher exact test	0.003‡
Mother	796 (78.5)	300 (80.9)	496 (77.1)
Father	141 (13.9)	58 (15.6)	83 (12.9)
Grandmother	53 (5.2)	9 (2.4)	44 (6.8)
Grandfather	22 (2.2)	4 (1.1)	18 (2.8)
Other§	2 (0.2)	0 (0.0)	2 (0.4)
Participant is primary caregiver, n (%):				χ2 = 14.87	0.001‡
Yes	687 (67.8)	279 (75.2)	408 (63.5)
No	327 (32.2)	92 (24.8)	235 (36.5)
Registered as urban or rural, n (%):				χ² = 15.96	<0.001‡
Urban	55 (5.4)	34 (9.2)	21 (3.3)
Rural	959 (94.6)	337 (90.8)	622 (96.7)
Family net income in last year in ¥, median (Q1-Q3)¶	25 000 (20,000- 40 000)	20 000 (15,000- 35 000)	25 000 (20,000- 40 000)	MWU/WW z = -2.24	0.03‡
Do not know family income, n (%)	252 (24.9)	104 (28.0)	148 (23.0)
Family living expenses in the last year in ¥, median (Q1-Q3)	20 000 (10,000- 20 000)	20 000 (10,000- 20 000)	20 000 (10,000- 20 000)	MWU/WW z = -0.55	0.58
Do not know family living expenses, n (%)	209 (20.6)	84 (22.6)	125 (19.4)

*χ²-test, Mann-Whitney U/Wilcoxon W (MWU/WW), quartile (Q), z-score (z).

†3 = junior high school, 4 = senior high school/technical school.

‡P < 0.05.

§Child’s aunt (wife of brother of father), child’s aunt (wife of brother of mother).

¶n = 646 for total group: n = 248 for group 1 and n = 398 for group 2.

Response rate

Of 1014 participants, 65.3% (n = 662) responded to text message 2 (asking about willingness to participate). Eleven participants responded that they were unwilling to participate and 651 were willing to participate (Figure 2 and Table S7 and S8 in Online Supplementary Document). The item response rates were only significantly different between the groups for two questions (text message 15 about the child having an illness with a cough, P = 0.03 and text message 18 about seeking care for a child with a fever or cough, P = 0.049)out of the total of 19 text message questions. Of the 651 participants willing to participate, 54.7% (n = 356) completed the text message survey, which was marginally significantly higher in group 1 (59.8%, n = 137) than group 2 (51.9%, n = 219) (P = 0.05).

Characteristics of responders vs non-responders

There was a significantly higher proportion of caregivers from rural areas in Zhao County in the non-responder group compared to the responder group (P = 0.004). In addition, the median number of text messages received was significantly lower in the non-responder group compared to the responder group (P = 0.046) (Table S9 in Online Supplementary Document). There were more children with diarrhoea in the non-completers group (P = 0.03) compared to the completers group (Table S10 in Online Supplementary Document).

Error proportion

We manually sent a total of 7971 text messages of which 145 (1.8%) were in an incorrect format by mistake: 70 with an additional number (minor editing error), 38 wrong questions and 37 repeated questions. We received a total of 4033 text message replies from participants of which 1358 text messages (33.7%) were in an incorrect format and had to be assessed (Figure 3). After the assessment, we clarified the meaning of 1181 text messages (29.3%) and we did not have to send another text message to follow these responses up. For the remaining 177 text messages (4.4%), we could not clarify the meaning of the text messages and we had to resent our text message question (Figure S1 in Online Supplementary Document). This process of resending text messages involved a total of 116 out of 662 participants responding to text message 2 (17.5%).

Figure 3. Incorrect text messages sent.

Data equivalence

Total group

Kappa values were moderate or substantial for most of the 15 nominal dichotomous questions. However, for questions answered by a small number of participants, kappa was fair (text message 6 and 11, for which the 95% CI also overlaps) or could not be calculated (text message 5 and 7). The proportion of agreement was as follows for 10 nominal dichotomous questions: >90% for five questions; between 80.0% and 90.0% for four questions; and 70.0% for one question. Out of five non-dichotomous questions, the proportion of agreement and kappa could not be calculated for four questions (text message 12, 13 19 and 20) that had fewer than 10 participants (Table 2 for total group, Table 3 for group 1 and 2, and Table S11-S15 in Online Supplementary Document). Weighted kappa values were substantial for Cicchetti-Allison and moderate for Fleiss-Cohen for text message 9. Both weighted kappa values were fair for text message 10a (Table 4 and Table S16 and S173 in Online Supplementary Document).

Table 2. Nominal questions, agreement and kappa coefficients for total group (N = 409)

Text message: brief content	Total group
Text message: brief content	No. of pairs	% agreement	Kappa	95% CI	P-value
4: child had diarrhoea	409	96.8	0.76	0.64-0.89	<0.001‡
5: child had blood in stools	21	100.0*	–	–	–
6: child drank Oral Rehydration Salts	19	84.2	0.31	-0.27-0.90	0.16
7: child drank recommended fluids	16	87.5*	–	–	–
8: child drank other fluids	16	87.5	0.74	0.41-1.00	0.002‡
11: sought care for diarrhoea	10	70.0	0.35	-0.24-0.94	0.26
12: why care was not sought for diarrhoea†	2	2/2 agree*	–	–	–
13: where care was sought for diarrhoea†	4	3/4 agree*	–	–	–
14: child had fever	365	95.1	0.71	0.59-0.84	<0.001‡
15: child had illness with cough	332	88.9	0.70	0.61-0.79	<0.001‡
16: child breathed fast or difficultly	308	97.1	0.65	0.44-0.86	<0.001‡
17: cause of fast or difficult breathing†	8	6/8 agree*	0.58	0.16-1.00	0.03‡
18: sought care for child during fever or cough	55	94.5	0.77	0.52-1.00	<0.001‡
19: why care was not sought for fever or cough†	2	2/2 agree*	–	–	–
20: where care was sought for fever or cough†	40	65.0*	–	–	–

CI – confidence interval

*Kappa cannot be calculated.

†Non-dichotomous.

‡P < 0.05.

Table 3. Nominal questions, agreement and kappa coefficients comparison between group 1 (n = 183) and group 2 (n = 226)

Text message: brief content	Group 1			Group 2
Text message: brief content	No. of pairs	Kappa or agreement	95% CI	No. of pairs	Kappa or agreement	95% CI
4: child had diarrhoea	183	0.90	0.77-1.00	226	0.68	0.50-0.86
5: child had blood in stools	10	10/10 agree*	–	11	11/11 agree*	–
6: child drank Oral Rehydration Salts	10	0.41 (8/10 agree)	-0.18-1.00	9	8/9 agree*	–
7: child drank recommended fluids	9	7/9 agree*	–	7	7/7 agree*	–
8: child drank other fluids	9	0.57 (7/9 agree)	0.10-1.00	7	7/7 agree*	–
11: sought care for diarrhoea	5	4/5 agree*	–	5	3/5 agree*	–
12: why care was not sought for diarrhoea†	0	–	–	2	2/2 agree*	–
13: where care was sought for diarrhoea†	4	3/4 agree*	–	0	–	–
14: child had fever	164	0.77	0.61-0.93	201	0.66	0.47-0.84
15: child had illness with cough	156	0.66	0.53-0.79	176	0.74	0.62-0.86
16: child breathed fast or difficultly	146	0.70	0.42-0.98	162	0.60	0.28-0.92
17: cause of fast or difficult breathing†	5	0.38 (3/5 agree)	-0.21-0.96	3	3/3 agree*	–
18: sought care for child during fever or cough	31	0.71	0.34-1.00	24	0.83	0.52-1.00
19: why care was not sought for fever or cough†	1	1/1 agree*	–	1	1/1 agree*	–
20: where care was sought for fever or cough†	23	13/23 agree*	–	17	13/17 agree*	–

*Kappa cannot be calculated.

†Non-dichotomous.

Table 4. Ordinal questions, agreement and weighted kappa coefficients for total group (N = 409) and comparison between group 1 (n = 183) and group 2 (n = 226)

Text message: brief content	Total group					Group 1			Group 2
Text message: brief content	No. of pairs	% agreement	Cicchetti-Allison and Fleiss-Cohen kappa	95% CI	P-value	No. of pairs	Cicchetti-Allison and Fleiss-Cohen kappa	95%CI	No. of pairs	Cicchetti-Allison and Fleiss-Cohen kappa	95% CI
9: how much did child drink during diarrhoea	14	85.7	0.66, 0.60	0.23-1.00, 0.07-1.00	<0.001, 0.01†	8	0.53, 0.44	0.005-1.00, -0.18-1.00	6	1.00, 1.00	1.00-1.00, 1.00-1.00
10a: how much did child eat during diarrhoea	11	45.5	0.37, 0.37	0.05-0.70, -0.08-0.83	0.02†, 0.09	5	0.70 0.83	0.34-1.00, 0.58-1.00	6	0.13, 0.05	-0.10-0.36, -0.26-0.36
10a: how much did child eat during diarrhoea*	6	4/6 agree	0.64, 0.74	0.21-1.00, 0.35-1.00	0.03, 0.049†	–	–	–	–	–	–
10: child had been introduced to complementary foods	12	8/12 agree	–	–	–	–	–	–	–	–	–

CI – confidence interval

*We excluded children whose caregivers said that they had not been introduced to complementary feeding by either face-to-face interview or text messaging.

†P < 0.05.

Comparison between group 1 and 2

Five nominal questions had overlapping 95% confidence interval (CI) (text message 4, 14, 15, 16 and 18). Eight questions had too small number of participants to calculate 95% CIs: two questions had perfect agreement (text message 5 and 19) and six questions had fairly similar agreement (text message 6, 7, 8, 11, 17 and 20). The remaining two questions could not be compared (Table 3). For text message 9, weighted kappa values were moderate for group 1 but perfect for group 2. For text message 10a, weighted kappa was substantial for Cicchetti-Allison kappa and almost perfect for Fleiss-Cohen kappa in group 1, but slight for both in group 2 (Table 4).

Amount of information

Three out of four participants provided the same number of places where care was sought for diarrhoea (text message 13). Forty participants responded to text message 20 for which Fleiss-Cohen weighted kappa indicated non-significant (P = 0.38) slight agreement and Cicchetti-Allison borderline significant (P = 0.046) fair agreement (Table 5, Table S18 and S19 the Online Supplementary Document).

Table 5. Amount of information, number of places recorded for total group (N = 409)

Text message: brief content	n of pairs	% agreement	Cicchetti-Allison and Fleiss-Cohen kappa	95% CI	P-value
13:where sought care for diarrhoea	4	3/4 agree*
20:where sought care for fever or cough	40	75.0	0.23, 0.13	-0.06 to 0.53, -0.23 to 0.49	0.046†, 0.38

*Kappa cannot be calculated.

†P < 0.05.

Reasons for differences in responses

Of the 226 participants in group 2, 51 participants provided 42 reasons for disagreement and 9 reasons were missing. For eight questions (text message 4, 6, 10a, 11, 12, 14, 15 and 16) there was disagreement between the face-to-face and text messaging answers. There were 17 reasons related to the text messaging method including 8 participants did not see the date (there were dates in the text messages to ask about the past two weeks), 3 misunderstood the question and 3 replied carelessly and did not pay attention. There were 22 reasons related to the face-to-face method including 8 participants did not know the accurate definition of a symptom, 7 misunderstood the question and 4 did not understand the date clearly. There were three other reasons that could not be directly related to the methods: two participants could not recall the information that was asked and one participant had a change of mind. (Table S20 in Online Supplementary Document).

DISCUSSION

Principal results

This study showed that almost two-thirds of participants responded to the first text message about willingness to participate. Of the participants who responded that they were willing to participate, more than half completed the text messaging survey. A significantly higher proportion of participants not responding to the first question had children with a rural Hukou and received a lower median number of text messages. There were more children with diarrhoea in the non-completers group compared to the completers group, which may be explained by that caregivers of a child with diarrhoea had to answer the largest number of questions (at least 14 questions).A large number of text messages were manually sent of which only a few were in an incorrect format (researcher-related errors). Of the text message replies received, about a third were in an incorrect format (participant-related errors), but only a small proportion required follow-up. Overall, text message data were moderately to substantially equivalent to face-to-face data. The proportion of agreement was as follows for the 10 nominal dichotomous questions: >90% for 5 questions; between 80% and 90% for 4 questions; and 70.0% for 1 question. Kappa was as follows for these 10 nominal dichotomous questions: substantial for 6 (text message 4, 8, 14, 15, 16 and 18); fair for whether care was sought for diarrhoea (text message 11) and whether an ORS was given (text message 6); and could not be calculated for 2 questions (text message 5 and 7) that had an insufficient number of values for each cell in the 2x2 classification table. A calculation after the study finished showed that power was >0.95 for question 4, 14 and 15 which had kappa values of at least 0.70, while question 16 only had a power of approximately 0.60 with a kappa value of 0.65. However, we were unable to calculate agreement for the remaining questions because power was insufficient for these questions with small sample sizes. Finally, both the face-to-face and text message surveys generated errors in responses that caregivers gave.

Strength and limitations

This is the first study using a cluster randomised crossover design to assess the validity of using text messaging for data collection in a middle-income country. Mobile phone use was high in our setting [53] and mobile phones were generally kept personally by the user. However, recruitment of caregivers was challenging and the text message response rate was insufficient for use in household surveys; these aspects were evaluated in another publication [50]. This allowed us to reach participants directly and ensure privacy. The findings of this study can be generalized to other rural areas in China with similar socio-economic characteristics, and levels of mobile phone use.

Despite stratified randomization, there were small differences in characteristics of the participants between group 1 and group 2. This may be explained by the cluster randomised design of the study in which it was more difficult to achieve balance between groups than in an individual randomised controlled study [57]. Participants in group 2 were less educated, lived more often in rural areas and were less likely to be the primary caregiver than participants in group 1, which may have contributed to a lower completion rate in group 2.

Four outcomes were part of six indicators for data quality [19,58]: data equivalence, amount of information, item response rate and completion rate. We could not assess two quality indicators: a validity check of the data against the true values and the absence of social desirability bias. We did not attempt to verify the reported diarrhoea or pneumonia symptoms because this was not the focus of our study.

Some of the kappa calculations were driven by low prevalence and we did not calculate a prevalence and bias adjusted kappa [59].

Comparison with prior work

The previous text message data collection study in Zhao County found a lower initial response and completion rate, which may be explained by differences in study designs. However, both studies found substantial data agreement and similar reasons for differences [23]. Other studies that assessed data agreement mainly took place in high-income countries and also found a high degree of agreement [28,34-36,39,41,42].

Few grandparents could be included in the crossover study, because they were often unable to text message. This may be less of a problem in other settings where elderly people are more commonly able to text message [60]. Previous mHealth-based data collection studies frequently only included younger participants [28,29,32,33,35,37,61]. While in other settings socio-economic factors may influence usage of mobile phones [62], this did not seem to be of large influence in the study setting as the costs of text messaging were very low.

A study in the United Kingdom used an automated text messaging system, but out of 2952 text messages sent in total, still 214 (7%) text messages had to be sent manually [28]. As a result of system or researcher errors, about 6% of participants were sent the wrong number of text messages (too many or too few), while for about 2% of participants there were other problems [28]. However, an automated system would be a huge benefit for researchers. Sending text messages manually was a labour intensive process as one researcher (YL) was continuously sending text messages 12 hours a day (from 9 am till 9 pm) during the study period (14 days). Also before and after sending text messages, she had to do additional work for the study (communicate with the field workers and preparation and checking work). A private software company was used to provide support for the text messaging system. Open-source software is an important issue for mHealth-based research in low- and middle income countries, because limited budgets would benefit from free software. However, in some circumstances private software has value too [63].

Future research

This study was an early attempt in a middle-income setting and thus more studies are required to address remaining questions. First, the response rate requires improvement as it was insufficient according to the requirements for a household survey (minimum response rate of 90%) [7]. There are many approaches that can increase the response rate which should be evaluated [13,23,50]. Text messaging could be a useful tool in remote rural areas, but our study showed that more rural participants refused to participate compared to people living in urban areas, which led to a lower response rate in rural areas. Second, while data equivalence is satisfactory for the questions that we were able to evaluate, there are still some questions with few responses where data equivalence has not yet been fully established. Further research is needed to establish the equivalence those questions. Third, cost-effectiveness is an important under-explored advantage of text messaging over current data collection methods and could reduce the need for resources, particularly in rural remote areas [13,22]. Fourth, operational guidelines need to be developed and tested to facilitate implementation of text messaging data collection [13].

CONCLUSIONS

This study shows that text messaging produces data similar to data collected by face-to-face interviews in a middle-income setting, but several questions remain. Improving the response rate is important because text messaging surveys could be of greater value in rural remote areas due to the cost-saving potential [15].

Acknowledgements

We would like to thank the study participants, interviewers, Dr Xinglei Sun and other colleagues from Zhao County Maternal and Child Health Hospital, Zhao County Health Bureau and Zhaozhou Township Hospital for their support for the fieldwork and Swarna Khare for help with statistics.

Notes

[1] Funding: The study was funded by the Capital Institute of Pediatrics. The Department of Primary Care and Public Health at Imperial College is grateful for support from the National Institute for Health Research Biomedical Research Centre Funding scheme, the National Institute for Health Research Collaboration for Leadership in Applied Health Research and Care scheme, and the Imperial Centre for Patient Safety and Service Quality. MV is funded by Imperial’s Global eHealth Unit and received a small grant from Santander for travelling to China. The funders had no role in study design, data collection, and analysis, decision to publish, or preparation of the manuscript.

[2] Authorship declaration: MV designed the study and drafted the article. MV, WW, QW, YL, XD, and LC collected the data. MV and WW jointly conducted the analyses. MV, WW, QW, YL, RS, XD, LC, YZ, JS, and IR made a substantial contribution to conception and design of the study and revised the article critically for important intellectual content.

[3] Competing interests: All authors have completed the Unified Competing Interest form at www.icmje.org/coi_disclosure.pdf (available on request from the corresponding author). The authors declare research funding from the Capital Institute of Pediatrics, National Institute for Health Research Biomedical Research Centre Funding scheme, the National Institute for Health Research Collaboration for Leadership in Applied Health Research and Care scheme, and the Imperial Centre for Patient Safety and Service Quality and Santander. The authors declare no financial relationships with other organizations that might have an interest in the submitted work in the previous three years, and no other relationships or activities that could appear to have influenced the submitted work. IR is the Editor-in-Chief of the Journal of Global Health. To ensure that any possible conflict of interest relevant to the journal has been addressed, this article was reviewed according to best practice guidelines of international editorial organisations.

REFERENCES

[1] L Liu, S Oza, D Hogan, J Perin, I Rudan, and JE Lawn. Global, regional, and national causes of child mortality in 2000-13, with projections to inform post-2015 priorities: an updated systematic analysis. Lancet. 2015;385:430-40. DOI: 10.1016/S0140-6736(14)61698-6. [PMID:25280870]

[2] J Bryce, R Black, and C Victora. Millennium Development Goals 4 and 5: progress and challenges. BMC Med. 2013;11:225 DOI: 10.1186/1741-7015-11-225. [PMID:24228742]

[3] J Bryce, F Arnold, A Blanc, A Hancioglu, H Newby, and J Requejo. Measuring coverage in MNCH: new findings, new strategies, and recommendations for action. PLoS Med. 2013;10:e1001423 DOI: 10.1371/journal.pmed.1001423. [PMID:23667340]

[4] UNICEF. Pneumonia and diarrhoea: Tackling the deadliest diseases for the world’s poorest children. 2012. Available: http://www.unicef.org/media/files/UNICEF_P_D_complete_0604.pdf Accessed: 23 October 2017.

[5] HK Herbert, AC Lee, A Chandran, I Rudan, and AH Baqui. Care seeking for neonatal illness in low- and middle-income countries: a systematic review. PLoS Med. 2012;9:e1001183 DOI: 10.1371/journal.pmed.1001183. [PMID:22412355]

[6] A Chandran, HK Herbert, AC Lee, I Rudan, and AH Baqui. Assessment of the proportion of neonates and children in low and middle income countries with access to a healthcare facility: A systematic review. BMC Res Notes. 2011;4:536 DOI: 10.1186/1756-0500-4-536. [PMID:22166258]

[7] A Hancioglu and F Arnold. Measuring coverage in MNCH: tracking progress in health for women and children using DHS and MICS household surveys. PLoS Med. 2013;10:e1001391 DOI: 10.1371/journal.pmed.1001391. [PMID:23667333]

[8] Yansaneh IS, Chapter XII. An analysis of cost issues for surveys in developing and transition countries. Household Sample Surveys in Developing and Transition Countries. New York: United Nations; 2005.

[9] N Bruce, D Pope, B Arana, C Shiels, C Romero, and R Klein. Determinants of care seeking for children with pneumonia and diarrhea in guatemala: implications for intervention strategies. Am J Public Health. 2014;104:647-57. DOI: 10.2105/AJPH.2013.301658. [PMID:24524510]

[10] JH Raven, Q Chen, RJ Tolhurst, and P Garner. Traditional beliefs and practices in the postpartum period in Fujian Province, China: a qualitative study. BMC Pregnancy Childbirth. 2007;7:8 DOI: 10.1186/1471-2393-7-8. [PMID:17584930]

[11] H Campbell, S El Arifeen, T Hazir, J O’Kelly, J Bryce, and I Rudan. Measuring coverage in MNCH: challenges in monitoring the proportion of young children with pneumonia who receive antibiotic treatment. PLoS Med. 2013;10:e1001421 DOI: 10.1371/journal.pmed.1001421. [PMID:23667338]

[12] JH Requejo, H Newby, and J Bryce. Measuring coverage in MNCH: challenges and opportunities in the selection of coverage indicators for global monitoring. PLoS Med. 2013;10:e1001416 DOI: 10.1371/journal.pmed.1001416. [PMID:23667336]

[13] Y Zhang, L Chen, MH van Velthoven, W Wang, L Liu, and X Du. mHealth Series: Measuring maternal newborn and child health coverage by text messaging - a county-level model for China. J Glob Health. 2013;3:020402 DOI: 10.7189/jogh.03.020402. [PMID:24363920]

[14] Birnbaum B, DeRenzi B, Flaxman AD, Lesh N. Automated quality control for mobile data collection. Proceedings of the 2nd ACM Symposium on Computing for Development; Atlanta, Georgia. 2160603: ACM; 2012.

[15] MH van Velthoven, J Car, Y Zhang, and A Marusic. mHealth series: New ideas for mHealth data collection implementation in low- and middle-income countries. J Glob Health. 2013;3:020101 DOI: 10.7189/jogh.03.020101. [PMID:24363911]

[16] WorldBank. Mobile cellular subscriptions 2016. Available: https://data.worldbank.org/indicator/IT.CEL.SETS Accessed: 23 October 2017.

[17] WA Kaplan. Can the ubiquitous power of mobile phones be used to improve health outcomes in developing countries? Global Health. 2006;2:9 DOI: 10.1186/1744-8603-2-9. [PMID:16719925]

[18] JE Maher, K Pranian, L Drach, M Rumptz, C Casciato, and J Guernsey. Using text messaging to contact difficult-to-reach study participants. Am J Public Health. 2010;100:969-70, author reply 70. DOI: 10.2105/AJPH.2009.188391. [PMID:20395564]

[19] A Bowling. Mode of questionnaire administration can have serious effects on data quality. J Public Health (Oxf). 2005;27:281-91. DOI: 10.1093/pubmed/fdi031. [PMID:15870099]

[20] Kasprzyk D, Chapter IX. Measurement error in household surveys: sources and measurement. Household Sample Surveys in Developing and Transition Countries. New York: United Nations; 2005.

[21] SS Hu, L Balluz, MP Battaglia, and MR Frankel. The impact of cell phones on public health surveillance. Bull World Health Organ. 2010;88:799 DOI: 10.2471/BLT.10.082669. [PMID:21076555]

[22] X Du, W Wang, MH Velthoven, L Chen, RW Scherpbier, and Y Zhang. mHealth Series: Text messaging data collection of infant and young child feeding practice in rural China - a feasibility study. J Glob Health. 2013;3:020403 DOI: 10.7189/jogh.03.020404. [PMID:24363921]

[23] Y Li, W Wang, MH van Velthoven, L Chen, J Car, and I Rudan. Text messaging data collection for monitoring an infant feeding intervention program in rural China: feasibility study. J Med Internet Res. 2013;15:e269 DOI: 10.2196/jmir.2906. [PMID:24305514]

[24] KL L’Engle, HL Vahdat, E Ndakidemi, C Lasway, and T Zan. Evaluating feasibility, reach and potential impact of a text message family planning information service in Tanzania. Contraception. 2013;87:251-6. DOI: 10.1016/j.contraception.2012.07.009. [PMID:22935322]

[25] LG Macedo, CG Maher, J Latimer, and JH McAuley. Feasibility of using short message service to collect pain outcomes in a low back pain clinical trial. Spine. 2012;37:1151-5. DOI: 10.1097/BRS.0b013e3182422df0. [PMID:22146289]

[26] B Suffoletto, C Callaway, J Kristan, K Kraemer, and DB Clark. Text-message-based drinking assessments and brief interventions for young adults discharged from the emergency department. Alcohol Clin Exp Res. 2012;36:552-60. DOI: 10.1111/j.1530-0277.2011.01646.x. [PMID:22168137]

[27] J Baird, ML Ranney, and MJ Mello. Comment on the paper by Suffoletto et al. entitled: Text-message-based assessments and brief intervention for young adults discharged from the emergency department. Alcohol Clin Exp Res. 2012;36:395-7. DOI: 10.1111/j.1530-0277.2011.01710.x. [PMID:22236062]

[28] HM Whitford, PT Donnan, AG Symon, G Kellett, E Monteith-Hodge, and P Rauchhaus. Evaluating the reliability, validity, acceptability, and practicality of SMS text messaging as a tool to collect research data: results from the Feeding Your Baby project. J Am Med Inform Assoc. 2012;19:744-9. DOI: 10.1136/amiajnl-2011-000785. [PMID:22539081]

[29] M Moller, J Attermann, G Myklebust, and N Wedderkopp. Injury risk in Danish youth and senior elite handball using a new SMS text messages approach. Br J Sports Med. 2012;46:531-7. DOI: 10.1136/bjsports-2012-091022. [PMID:22554848]

[30] I Axén, L Bodin, G Bergstrom, L Halasz, F Lange, and PW Lovgren. The use of weekly text messaging over 6 months was a feasible method for monitoring the clinical course of low back pain in patients seeking chiropractic care. J Clin Epidemiol. 2012;65:454-61. DOI: 10.1016/j.jclinepi.2011.07.012. [PMID:22169083]

[31] M Magee, A Isakov, HT Paradise, and P Sullivan. Mobile phones and short message service texts to collect situational awareness data during simulated public health critical events. Am J Disaster Med. 2011;6:379-85. DOI: 10.5055/ajdm.2011.0076. [PMID:22338318]

[32] SM Schembre and J Yuen. Project TwEATs. A feasibility study testing the use of automated text messaging to monitor appetite ratings in a free-living population. Appetite. 2011;56:465-8. DOI: 10.1016/j.appet.2011.01.014. [PMID:21251941]

[33] JE Haberer, J Kiwanuka, D Nansera, IB Wilson, and DR Bangsberg. Challenges in using mobile phones for collection of antiretroviral therapy adherence data in a resource-limited setting. AIDS Behav. 2010;14:1294-301. DOI: 10.1007/s10461-010-9720-1. [PMID:20532605]

[34] B Johansen and N Wedderkopp. Comparison between data obtained through real-time data capture by SMS and a retrospective telephone interview. Chiropr Osteopat. 2010;18:10 DOI: 10.1186/1746-1340-18-10. [PMID:20500900]

[35] MS Lim, R Sacks-Davis, CK Aitken, JS Hocking, and ME Hellard. Randomised controlled trial of paper, online and SMS diaries for collecting sexual behaviour information from young people. J Epidemiol Community Health. 2010;64:885-9. DOI: 10.1136/jech.2008.085316. [PMID:19767322]

[36] G Alfvén. SMS pain diary: a method for real-time data capture of recurrent pain in childhood. Acta Paediatr. 2010;99:1047-53. DOI: 10.1111/j.1651-2227.2010.01735.x. [PMID:20219052]

[37] S Kew. Text messaging: an innovative method of data collection in medical research. BMC Res Notes. 2010;3:342 DOI: 10.1186/1756-0500-3-342. [PMID:21172018]

[38] A Kongsted and C Leboeuf-Yde. The Nordic back pain subpopulation program–individual patterns of low back pain established by means of text messaging: a longitudinal pilot study. Chiropr Osteopat. 2009;17:11 DOI: 10.1186/1746-1340-17-11. [PMID:19919715]

[39] E Kuntsche and B Robert. Short message service (SMS) technology in alcohol research–a feasibility study. Alcohol Alcohol. 2009;44:423-8. DOI: 10.1093/alcalc/agp033. [PMID:19482879]

[40] A Roberts and A Gorman. Short message service for outpatient data collection. Br J Anaesth. 2009;102:436 DOI: 10.1093/bja/aen400. [PMID:19218377]

[41] C Bexelius, H Merk, S Sandin, A Ekman, O Nyren, and S Kuhlmann-Berenzon. SMS versus telephone interviews for epidemiological data collection: feasibility study estimating influenza vaccination coverage in the Swedish population. Eur J Epidemiol. 2009;24:73-81. DOI: 10.1007/s10654-008-9306-7. [PMID:19082745]

[42] DM Haller, LA Sanci, GC Patton, and SM Sawyer. Text message communication in primary care research: a randomized controlled trial. Fam Pract. 2009;26:325-30. DOI: 10.1093/fampra/cmp040. [PMID:19542193]

[43] D Haller, L Sanci, S Sawyer, C Coffey, and GRU Patton. OK 2 TXT 4 RESEARCH?–feasibility of text message communication in primary care research. Aust Fam Physician. 2006;35:175-6. [PMID:16525536]

[44] J Anhřj and C Moldrup. Feasibility of collecting diary data from asthma patients through mobile phones and SMS (short message service): response rate analysis and focus group evaluation from a pilot study. J Med Internet Res. 2004;6:e42 DOI: 10.2196/jmir.6.4.e42. [PMID:15631966]

[45] MH van Velthoven, Y Li, W Wang, X Du, Q Wu, and L Chen. mHealth Series: mHealth project in Zhao County, rural China - Description of objectives, field site and methods. J Glob Health. 2013;3:020401 DOI: 10.7189/jogh.03.020401. [PMID:24363919]

[46] Statistical Communiqué of the People's Republic of China on the 2010 National Economic and Social Development. Beijing: Beijing National Bureau of Statistics of China; 2010.

[47] Historical currency rates. 9 May 2014. Available: http://www.oanda.com/currency/historical-rates/. Accessed: 23 October 2017.

[48] D Hipgrave. Communicable disease control in China: From Mao to now. J Glob Health. 2011;1:224-38. [PMID:23198121]

[49] C Weiyuan. China’s village doctors take great strides. Bull World Health Organ. 2008;86:914-5. DOI: 10.2471/BLT.08.021208. [PMID:19142288]

[50] MH van Velthoven, Y Li, W Wang, X Du, L Chen, and Q Wu. mHealth Series: Factors influencing sample size calculations for mHealth-based studies - A mixed methods study in rural China. J Glob Health. 2013;3:020404 [PMID:24363922]

[51] SAS version 9.2. Marlow, UK: SAS Institute Inc; 2008.

[52] Willis G. Cognitive Interviewing, A “How To” Guide 1999 9 May 2014. Available: http://www.dfpp.univr.it/documenti/OccorrenzaIns/matdid/matdid823948.pdf Accessed: 23 October 2017.

[53] S Zhang, Q Wu, MH van Velthoven, L Chen, J Car, and I Rudan. Smartphone versus pen-and-paper data collection of infant feeding practices in rural China. J Med Internet Res. 2012;14:e119 DOI: 10.2196/jmir.2183. [PMID:22989894]

[54] SPSS for Windows version 16.0. Chicago: SPSS Inc; 2007.

[55] S Vanbelle and A Albert. A note on the linearly weighted kappa coefficient for ordinal scales. Stat Methodol. 2009;6:157-63. DOI: 10.1016/j.stamet.2008.06.001

[56] JR Landis and GG Koch. The measurement of observer agreement for categorical data. Biometrics. 1977;33:159-74. DOI: 10.2307/2529310. [PMID:843571]

[57] S Hahn, S Puffer, DJ Torgerson, and J Watson. Methodological bias in cluster randomised trials. BMC Med Res Methodol. 2005;5:10 DOI: 10.1186/1471-2288-5-10. [PMID:15743523]

[58] De Leeuw E, van der Zouwen J. Data quality in telephone and face-to-face surveys: a comparative meta-analysis. In: Groves R, Biemer P, Lyberg L, editors. Telephone survey methodology. New York: John Wiley and Sons; 1988. p. 283-300.

[59] L Flight and SA Julious. The disagreeable behaviour of the kappa statistic. Pharm Stat. 2015;14:74-8. DOI: 10.1002/pst.1659. [PMID:25470361]

[60] LD Hughes, J Done, and A Young. Not 2 old 2 TXT: there is potential to use email and SMS text message healthcare reminders for rheumatology patients up to 65 years old. Health Informatics J. 2011;17:266-76. DOI: 10.1177/1460458211422019. [PMID:22193827]

[61] R Schnall, A Okoniewski, V Tiase, A Low, M Rodriguez, and S Kaplan. Using text messaging to assess adolescents’ health information needs: an ecological momentary assessment. J Med Internet Res. 2013;15:e54 DOI: 10.2196/jmir.2395. [PMID:23467200]

[62] A Wesolowski, N Eagle, AM Noor, RW Snow, and CO Buckee. Heterogeneous mobile phone ownership and usage patterns in Kenya. PLoS One. 2012;7:e35319 DOI: 10.1371/journal.pone.0035319. [PMID:22558140]

[63] M Tomlinson, W Solomon, Y Singh, T Doherty, M Chopra, and P Ijumba. The use of mobile phones as a data collection tool: a report from a household survey in South Africa. BMC Med Inform Decis Mak. 2009;9:51 DOI: 10.1186/1472-6947-9-51. [PMID:20030813]

Background

Methods

Results

Conclusions

Design

Study setting and sample

Randomization

Questionnaires

Recruitment and data collection

Outcomes and analysis

Ethics approval

Participants

Response rate

Characteristics of responders vs non-responders

Error proportion

Data equivalence

Total group

Comparison between group 1 and 2

Amount of information

Reasons for differences in responses

Principal results

Strength and limitations

Comparison with prior work

Future research