Effective management of chronic diseases is multifactorial and includes health system components related to prevention, effective screening and diagnosis, pharmacological/non-pharmacological treatment strategies and then lifelong, routine monitoring for continued patient health. The often long-term nature of chronic disease, especially when health systems prioritise early diagnosis, results in considerable strain on health service resources. Consequently, increasing burdens of chronic diseases pose huge challenges in LMIC health systems, which have been developed mainly for acute conditions and single communicable disease vertical care systems (eg, Malaria, or HIV) and so are ill-equipped to deal with the costs of disease management and long-term treatment required for effective care of people with chronic conditions [1,2]. Depending on age, sex and chronic condition profile, diagnosis with a chronic condition is expected to result in increase in health care services use and costs [3] due to increased physician visits, hospital care, and prescription drugs [4], with these costs lying squarely on the country’s health system, private insurance or individual patients as out-of-pocket payments [5].
Key tools that aid in chronic disease management in both policy and practice, from clinical decisions at the bedside, and patient choices, to governance of health facilities and spending by government and health insurers are clinical practice guidelines [6]. International and national clinical practice guidelines are meant to contain pragmatic and standardized actions or recommendations to be taken by various stakeholders in order to improve the quality of care and subsequently patient health outcomes [7-10]. Guidelines based care has been promoted given that successful implementation often leads to reductions in both mortality and morbidity [7], and costs [11]. In spite of possible benefits for clinical providers to adhere to guideline-based care [7-10], most measures of implementation fidelity and adherence to guidelines are poor [8,10]. In LMICs factors such as poor health literacy, limited health budgets, limited facilities, inadequate clinical expertise and personnel, poor drug supply and out-of-pocket health expenditures are thought to predominate [12], whereas in HIC, complexity of guidelines, poor access to the guidelines for clinicians, time and resource constraints have also been cited as reasons for non-adherence to clinical practice guidelines [10]. In both settings a combination of health system factors and individual factors (both provider and patient) have a role, as does how the guidelines were developed [13-15].
Due to a paucity of well-designed guidelines in LMICs, there is a general reliance on guidelines developed in HICs by third-party guideline production organisations (eg, UK’s National Institute for Health and Clinical Excellence), professional associations (eg, American Diabetes Association) and government health departments or those developed by international organisations (eg, World Health Organisation) [13]. However, the extent to which guidelines developed for HIC settings and patient representation, are relevant to LMIC settings, is a major challenge [9]. Comparisons of guidelines for T2DM and hypertension in LMIC vs HIC settings found that most LMIC guidelines were lacking in terms of applicability, clarity and socioeconomic contextualization as they were adapted from existing HIC guidelines [16 ,17], casting light on the difficulty of high implementation fidelity in such settings.
This analysis describes how major guidelines, particularly national clinical care recommendations are developed, and summarises the strength of evidence, comparing LMIC and HIC settings for three important chronic diseases: HIV, hypertension and type 2 diabetes mellitus (T2DM). The focus on these chronic conditions is based on how they are a major cause for concern for low- and middle- income countries (LMICs) [18,19]. Specifically, the direct and indirect relationships between cardiovascular disease, hypertension, and T2DM with HIV and antiretroviral therapy (ART) [20], along with rapid urbanisation, and dietary and lifestyle changes [21], are projected to lead to substantial increases in the prevalence of non-communicable diseases rates in LMIC, but particularly in countries with high HIV burden. Currently, sub-Saharan Africa carries almost two-thirds of the HIV burden globally [22] and is predicted, along with Asia, to see the greatest increase in T2DM prevalence by 2045 [23]. Both regions are also among the regions with the highest prevalence of hypertension, a major modifiable risk factor for cardiovascular disease which is the leading cause of death worldwide [24].
The focus of this study is on the guidelines for the post-diagnosis management and monitoring of disease due to the significant health system cost of long-term monitoring. Specifically, the study will focus on the use of biomarkers, which in the broadest definition, are physical measures of individual health commonly used to monitor disease progression, treatment efficacy, and to determine risk of progression or complication [11]. For the three chronic diseases in question, biomarker-based monitoring is conducted using nearly direct and immediate measures of disease like viral load (VL) for HIV and blood pressure (Systolic blood pressure/Diastolic blood pressure) for hypertension, or indirect, medium term markers; Haemoglobin A1c (HbA1c) for T2DM.
Search strategy
Searches for guideline documents were conducted on PubMed, Google, and health ministry websites. The terms ‘diabetes’, ‘hypertension’ or ‘HIV/AIDS’, combined with ‘guideline’ or ‘recommendations’ or ‘consensus’, and the country names were used to identify guideline documents. Country selection was based on the classifications by income level, as given by the 2020 World Bank classifications [25]. As a starting point, the countries identified as having guidelines for T2DM or hypertension by Owolabi et al [16, 17] were chosen. These included; LIC: Afghanistan, Ethiopia, Liberia, Malawi, Tanzania, Uganda, Zimbabwe; Lower-MIC: Ghana, India, Kenya, Nigeria, the Philippines, Sri Lanka, Swaziland, Zambia; Upper-MIC: Botswana, Brazil, China, Colombia, Fiji, Jamaica, Libya, Malaysia, Mexico, South Africa, Turkey; HIC: Argentina, Australia, Bahrain, Bermuda, Canada, Greece, Hong Kong, Korea Republic, New Zealand, Scotland, Singapore, Sweden, United Kingdom, U.S.A. Additionally, guidelines produced for each condition by regional (Europe and Latin America) or global organisations (eg, World Health Organisation, International Diabetes Federation) were also included, with only recommendations or guidelines published between 2010-2020 included. For guidelines published multiple times during this period, the most recent guideline was summarised with reference to older guidelines. Guidelines that were not in English were translated using Google translate.
Data extraction
As guidelines vary in their scope, we focused on sections that covered the treatment targets and the frequency of biomarker monitoring among non-pregnant adults for each of the three conditions. The specific recommendations, the strength of the recommendations and the quality of the evidence were all extracted and retained for further analysis.
Analysis
The identified guidelines were assessed for the basis of the guidelines (were they specifically based on international guidelines or consensus documents), specific recommendations for treatment targets of non-pregnant adults and the frequency of biomarker monitoring, the strength of the recommendations and the quality of the evidence. Recommendations focusing on special populations were excluded from this analysis. All results were presented with frequencies and percentages. Univariate analysis was carried out to assess differences in selected variables by income classification, with a P value <0.05 deemed significant. All analyses were done in R version 3.6 (R Core Team, Vienna, Austria).
A total of 89 full text guidelines from the 40 countries, and 8 full-text guidelines from international/regional organisations were reviewed. Specifically, 37 T2DM guidelines, 33 hypertension guidelines and 27 HIV guidelines were reviewed for this study. Less than half of the countries had national guidelines for all three conditions (n = 18, 42.9%), while 10 (23.8%) had guidelines for 2 of the conditions and a third only had guidelines for only one condition (n = 14, 33.3%). Fully, 85% (n = 34) of the 40 countries included had guidelines for T2DM and included specific recommendations for long-term management. Nearly three-quarters of countries (n = 30, 71.4%) had hypertension guidelines that specified treatment targets, and just over half of the countries (n = 25, 60%) had HIV guidelines that also outlined biomarker-based treatment targets. A summary of the country of origin, year of publication, source of the guidelines, recommendations, the strength of the recommendations, and the quality of the evidence for control targets, and the frequency of monitoring are reported in Tables S1-S3 in the Online Supplementary Document .
Treatment targets
Treatment targets for all three conditions vary across guidelines as shown in Figure 1 . Most T2DM guidelines recommend an HbA1c target of ≤7.0% (n = 26, 72.2%) or ≤6.5% (n = 7, 19.4%) as the ideal glycaemic target for non-pregnant adults. Hypertension guidelines recommend blood pressure (systolic blood pressure/diastolic blood pressure) targets of <140/90 mm Hg (94% of the guidelines) and <130/80 mm Hg (6% of the guidelines) for the same population showing remarkable consistency across countries and settings. For HIV, approximately two-thirds (n = 18, 66.7%) of the identified guidelines define virologic failure as HIV viral load >1000 copies/mL, with 26% (n = 7/27), mostly HICs, defining virologic failure as a HIV viral load >200 copies/mL. While there is no discernible relationship between the choice of treatment target and country income classification (HIC vs LMIC) for T2DM (P = 0.436), and hypertension (P = 0.685), there does appear to be a difference between the HIV virologic failure definition and guideline setting (P = 0.001).
Overall, only 14 (42.4%) of the identified hypertension guidelines, including the global and regional guidelines, provide the strength of the recommendation or the quality of the evidence supporting the adoption of the treatment goal, with many of the guidelines being from HICs (9 of the guidelines) or Upper-MICs (3 of the guidelines). For T2DM guidelines, 20 (54%) of the identified guidelines provide the strength of the recommendation or the quality of the evidence supporting the adoption of the glycaemic target, with HICs and Upper-MICs accounting for 18 (90%) of these, while only 10 (37%) of the HIV guidelines provide the quality of evidence supporting the definition of treatment failure. Comparing the availability of the strength of the recommendation or the quality of the evidence in guidelines between HIC and LMIC, 9 (82%) of the hypertension guidelines from HIC provided both as opposed to 3 (16%) LMIC guidelines (P < 0.001). All HIV guidelines from HIC provided information on the strength of the recommendation or evidence rating as opposed to 1 guideline from LMIC (P < 0.001), while 11 (79%) of the T2DM guidelines from HIC as opposed to 8 (40%) from LMIC (P = 0.026).
Monitoring frequency
In most of the identified guidelines, for all three conditions, recommendations for the frequency of monitoring the relevant biomarkers were different for newly diagnosed patients (on treatment) or patients not meeting targets, and patients meeting targets on stable, unchanging therapy. Recommendations for the frequency of monitoring for any diagnosed patients were available in 18 (55%) of the hypertension guidelines, 25 (93%) of HIV guidelines, and 27 (73%) of the T2DM guidelines. Of these, only 5 (28%) of the hypertension guidelines, 4 (16%) of the HIV guidelines and 11 (41%) of the T2DM guidelines provide the strength of the recommendation and the quality of the evidence.
Newly diagnosed patients or patients not meeting targets often require more frequent monitoring than for patients meeting targets as there is a need to assess a patient’s response to treatment and possible adverse effects before achieving control [11]. This difference is reflected in most of the identified guidelines, for all three conditions. As an example, recommendations for monitoring HbA1c among newly diagnosed patients or patients not meeting targets ranges from 2 to 6 months, while stable patients can be monitored between 2 months and 1 year ( Figure 2 and Figure 3 ). Comparing the recommended frequency of monitoring for HIC vs LMIC, there is no significant difference for T2DM and hypertension, while there appears to be a difference between income classification and the recommended frequency of monitoring of HIV viral load for both newly diagnosed patients (P = 0.005) and stable patients (P = 0.017).
Strength of the recommendations and the quality of evidence
When provided, the rating scales of both strength of the recommendation and the quality of evidence often differed depending on guideline source(country) and condition. The strength of the recommendation is dependent, primarily, on the quality of the clinical and economic evidence supporting the recommendation [26]. Other considerations include patient values and preferences, applicability of published evidence to the target population, the balance of benefits and harms of the options, and economic costs [13]. Generally, the grading of recommendations ranges from strong, conditional, optional, or weak. Consistent, patient-oriented evidence from systematic reviews, meta analyses on high-quality RCTs with a low risk of bias, or high-quality individual RCTs are generally associated with the strong recommendations, while consensus guidelines, usual practice, expert opinion and disease-oriented evidence are generally associated with weak recommendations. The glycaemic targets, HbA1c <7.0% and HbA1c <6.5% both had strong recommendations in T2DM guidelines, based on high quality evidence, as did blood pressure targets <130/80 mm Hg and <140/90 mm Hg in hypertension guidelines. The most common definitions of viral failure, VL>200 copies/mL and VL>1000 copies/mL, both had a strong recommendation though the latter is based on low-quality evidence. For all three conditions, the frequency of monitoring in most of the guidelines was based on consensus and expert opinion and had weak recommendations.
By focusing on treatment targets and the frequency of monitoring after initiating treatment, we have provided valuable insight into the differences and similarities of the recommendations adopted in clinical practice guidelines in HICs and LMICs for the management of HIV, hypertension and T2DM. Moreover, this study has also been able, in part, to identify how current recommendations were arrived at for the three conditions. Treatment targets are largely based on systematic reviews and meta analyses on high-quality RCTs with a low risk of bias, or high-quality individual RCTs, often on populations from high income settings. The frequency of monitoring is often based on expert opinion or clinical consensus with little to no consideration made for factors like genomics, socioeconomic context or resource availability despite significant disparities among developed and developing countries. For all three conditions, there are questions concerning the choice of target and thresholds for treatment failure, while validation of the frequency of monitoring in different settings has been the subject of ongoing research.
Treatment targets
There is general agreement on the target for glycaemic control with most guidelines recommending HbA1c <7.0%, while a few recommend a more stringent target of HbA1c <6.5%. In all the recommendations for treatment targets, allowance was made factors like age, risk of hypoglycaemia, duration of diabetes and comorbidities. Targets for glycaemic control are generally based on the same randomised trials, the United Kingdom Prospective Diabetes Study (UKPDS,1977-1997), the Action to Control Cardiovascular Risk in Diabetes (ACCORD, 2001-2005) study, the Kumamoto study (1995), and the Action in Diabetes and Vascular Disease: Preterax and Diamicron MR Controlled Evaluation (ADVANCE, 2008) study [27-30 ]. In these studies, glycated haemoglobin (A1C) levels below 7.0% were found to be associated with reductions in clinical microvascular and cardiovascular events. The demographics of the study participants, however, suggest that generalization to LMICs may be inappropriate [31]. The UKPDS had 83% of participants with European ethnicity with a median age of 54 years (IQR 48–60 years), while the ACCORD trial had a mean age of 62 with 64.8% being white. Given the variability in HbA1c for factors like age, race, ethnicity and the existence of comorbidities, assigning a fixed target might not be appropriate thereby giving credence to the Diabetes Canada’s recommendation for individualised targets [32].
Targets for monitoring BP were also consistent at <140/90 mm Hg for most countries and regional bodies and were based on evidence from the ACCORD study [30]. The only exception, however, was the most recent recommendation from the American College of Cardiology (ACC) and American Heart Association (AHA) which was more stringent in defining hypertension and the treatment target [33]. The 2017 ACC/AHA guidelines, which recommend systolic BP<130 or diastolic BP<80 mm Hg, was based on evidence from the SPRINT (Systolic Blood Pressure Intervention Trial) which found that adopting the new definition leads to a reduced rate of major adverse cardiovascular events [34]. This, however, has been questioned by many experts. Poulter et al [35]. on behalf of the International Society of Hypertension, questioned the relevance of these guidelines from a global perspective, especially in low- and middle-income countries. Issues raised included the low global control rates (15%) which would be even lower under the ACC/AHA guidelines and an increase in the prevalence of hypertension. Additional concerns raised were the dangers of inappropriately labelling people as hypertensive which may cause anxiety and over-inflated hypertension treatment in low-risk younger people (especially women) and inadvertent drug associated adverse events.
Looking at the impact of adopting the new blood pressure targets on the South African population, Rayner et al [36] estimated that the prevalence of hypertension would increase from 35.1% in South Africa in 2018 under the current definition, to 50.2% if the new definition is applied. This was calculated assuming that the increase in the number of patients with diagnosed hypertension under this new guideline would be the same as the increase by reported in the USA (43%) [37]. Additionally, Rayner et al highlighted how the new targets would lead to increased demands on already overburdened and under-resourced health services. Specifically, it would lead to increases in; health visits to monitor patients, use of antihypertensives to achieve the lower target, use of laboratory services to monitor for adverse effects. Moreover, it would require a major retraining of health workers. In terms of the financial implications, Wang et al [38] mention how this definition also leads to substantial increases in the direct medical costs with China needing an additional US$ 3.6 billion direct medical costs in each fiscal year.
For viral load, most LMICs adopted the WHO recommendation of VL>1000 copies/mL on 2 consecutive viral load measurements as indicative of failure [39]. The chosen target is based on the fact that the risk of HIV transmission and disease progression is very low when viral load is lower than 1000 copies/mL, and that below this threshold, viral blips or intermittent low-level viraemia (50–1000 copies/mL) can occur during effective treatment but have not been associated with an increased risk of treatment failure [40-42]. Intuitively, if LMICs used the definition used by HICs this would also lead to an increase in the number of individuals with treatment failure, hence requiring more frequent monitoring and testing and potentially unnecessary changes in treatment.
Monitoring frequency
As monitoring frequency recommendations for all three conditions were based on clinical experience or expert consensus, there is a gap in the evidence on the benefits of repeated testing on patient outcomes [43]. There are, however, several studies investigating the evidence on the optimal monitoring strategies and testing intervals for the three conditions in question. For T2D, studies investigating the relationship between frequency of HbA1c monitoring and glycaemic control have found that lower frequency of monitoring generally led to poor glycaemic control [44,45 ]. For newly diagnosed patients and patients not meeting targets, the optimum testing frequency was between 2-4 months, with a testing frequency greater than 6 months being associated with a deterioration in control [44-47]. Much like for T2DM patients, patients with newly diagnosed hypertension and those not meeting hypertension treatment targets are expected to have more frequent health care facility visits until they are well-controlled. For blood pressure monitoring, there is evidence to suggest that there are no significant true blood pressure changes within a 12-month period for well controlled patients [48-50 ], however, reviews every 6 months allow for reinforcement of lifestyle changes, assessment of new risk factors, a review of medication adherence and repeat prescriptions [51].
For HIV, frequent VL monitoring has been found to enable early and accurate diagnosis of treatment failure before the development of drug resistance mutations, thereby improving the quality of care that HIV patients receive [52]. Some studies have shown that visit intervals longer than 3 months did not worsen viral load, and that gaps in care of less than 9 months led to non-significant viral load increases [53-55], especially in clinically stable patients with suppressed viral load. It should be noted that most of the studies mentioned here were in high income settings. As such, there is a need for further investigations into different monitoring frequencies in lower- and middle-income settings.
While there is not much difference between the frequency of monitoring HbA1c in T2DM patients and blood pressure in patients with diagnosed hypertension for LMICs vs HIC, intervals chosen for newly diagnosed HIV patients on ART, patients not meeting targets, and patients meeting targets are also wider for LMICs compared to HICs for individuals on ART. The WHO recommendation of monitoring 6 and 12 months after ART initiation then annually once viral suppression is attained has been adopted by many LMICs. This recommendation, however, has been questioned, with Rafie et al [56] arguing that the use of annual intervals rather than 6-monthly intervals likely increased the time patients remained on a failing (standard) regimen by an additional 6 months, thus increasing the risk of developing antiretroviral resistance mutations.
Most HIC guidelines and Upper-MIC, which were developed by professional associations or third-party guideline production organisations, were clear about the source of the recommendations. Recommendations and the quality of evidence were often graded using the Grading of Recommendations, Assessment, Development and Evaluations (GRADE) [57] or the Strength of Recommendation Taxonomy (SORT) [58] tools. The use of the Appraisal of Guidelines Research & Evaluation (AGREE II) instrument [59] to evaluate and appraise the guidelines was also reported. Another tool which was reported is the ADAPTE methodology, which is used when adapting guidelines produced elsewhere for local use and essentially involves taking the best or most appropriate recommendations and repackaging them into a new local guideline [60 ]. In contrast, it can be difficult to ascertain how recommendations were arrived at in some LICs and Lower-MICs as guidelines were vague about the source of recommendations [16,17]. In essence, most guidelines from LMICs were largely adopted or adapted from existing HIC guidelines or international and regional organisation guidelines. However, considerations for resource availability, contextual factors, logistical issues and general feasibility are often not made hence affecting implementability of guidelines in these areas. The paucity of locally derived evidence, poorly designed clinical studies, and low weight of evidence [17] have been identified as possible bottlenecks for developing recommendations that are contextually relevant.
Strengths and limitations of the study
This is the first study to compare specific guideline recommendations for the frequency of monitoring and treatment targets among patients with T2DM, HIV/AIDS or hypertension in a range of countries. Despite its achievements, this study is not without limitations. First, the study did not make use of the instrument that is considered the gold standard for practice guideline evaluation, the Appraisal of Guidelines for Research and Evaluation (AGREE II) [59 ]. This is because the aim of this study was not to evaluate the quality of the guidelines as a whole, but to compare specific recommendations between countries and to identify the evidence guiding the adoption of the recommendation. Second, the focus on just monitoring frequency and treatment targets can be seen as a weakness as chronic disease management entails more than just monitoring and treatment targets. Third, the country selection procedure was not exhaustive as it was based on previous studies. We, however, believe that the aim of our study is still met, and the analysis is not altered greatly by the countries not selected. Lastly, monitoring often includes aspects of routine health care and/or routine screening for other conditions which may bring indirect benefit to individuals and populations. This was not accounted for in this study.
While clinical practice guidelines seek to address the clear medical need for guidance in the management of chronic conditions, current recommendations for the frequency of monitoring and targets after initiating treatment of T2DM, HIV or hypertension may not meet the needs of local populations at the national or provincial level in LMICs because they are usually drawn from guidelines developed and based on studies carried out in HIC. It might therefore be necessary to generate LMIC-specific fine tuning of recommendations in order to meet the chronic care needs of these populations with an understanding of the socioeconomic context. To generate evidence, in the absence of well designed, adequately powered, randomised, and nonrandomised trials in LMICs, it might be worthwhile to investigate the increased use of simulation studies. It may also be necessary to develop studies that validate current recommendations in LMICs, especially with regards to the frequency of monitoring.