{"id":45792,"date":"2014-11-03T16:08:11","date_gmt":"2014-11-03T21:08:11","guid":{"rendered":"http:\/\/blogs.nejm.org\/cardioexchange\/?post_type=voices&#038;p=45792"},"modified":"2014-11-03T16:08:11","modified_gmt":"2014-11-03T21:08:11","slug":"a-big-data-approach-to-phenotype-based-clustering-of-heart-failure-patients","status":"publish","type":"post","link":"https:\/\/blogs.nejm.org\/cardioexchange\/2014\/11\/03\/a-big-data-approach-to-phenotype-based-clustering-of-heart-failure-patients\/","title":{"rendered":"A \u201cBig Data\u201d Approach to Phenotype-Based Clustering of Heart Failure Patients"},"content":{"rendered":"<p><i>CardioExchange\u2019s <b>Harlan M. Krumholz<\/b> and <b>John Ryan<\/b> interview <b>Tariq Ahmad<\/b> about his research group\u2019s study of clinical phenotypes in patients with chronic systolic heart failure. <a href=\"http:\/\/content.onlinejacc.org\/article.aspx?articleID=1918794\">The article, published in JACC,<\/a> includes the complete list of assessed variables.<\/i><\/p>\n<p><b><i>Krumholz and Ryan:<\/i><\/b><b> Please summarize your findings for our readers.<\/b><\/p>\n<p><b><i>Ahmad:<\/i><\/b> We performed a cluster analysis of 45 clinical variables from roughly 1600 patients with systolic heart failure who, in the <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/19351941\">HF-ACTION clinical trial<\/a>, had been randomly assigned to exercise training or usual care. Using this approach, we identified four phenotypically distinct, clinically meaningful groups of patients:<\/p>\n<ul>\n<li><i>Cluster 1:<\/i> predominantly elderly Caucasian men with ischemic cardiomyopathy and a high burden of comorbidities, advanced disease, and the highest mortality rate;<\/li>\n<li><i>Cluster 2:<\/i> the youngest patients, largely African Americans, with nonischemic cardiomyopathy and milder disease overall, but with high hospitalization rates in the face of lower overall mortality;<\/li>\n<li><i>Cluster 3:<\/i> patients with ischemic cardiomyopathy and severe angina symptoms;<\/li>\n<li><i>Cluster 4:<\/i> primarily Caucasian patients with nonischemic cardiomyopathy and milder disease.<\/li>\n<\/ul>\n<p>The groups differed in their risk for clinical outcomes and their response to exercise therapy. For example, Clusters 2 and 3 had significant improvement in peak VO<sub>2<\/sub> with exercise, whereas the other clusters did not. These findings highlight significant heterogeneity within the syndrome we call heart failure, as well as the potential for \u201cbig data\u201d approaches to improve phenotyping of the syndrome.<\/p>\n<p><b><i>Krumholz and Ryan:<\/i><\/b><b> Was the cluster-analysis software something you had been trained in before? Most of us think of cluster analysis in terms of genetic studies. How difficult was it to perform this in a clinical analysis?<\/b><\/p>\n<p><b><i>Ahmad:<\/i><\/b> I had not been trained in this approach, but I had experience with it because of my background in \u201comics\u201d research, where the approach is often used to make sense of large amounts of biological data. I was just completing a project on metabolomics of heart failure when this idea occurred to me. I consulted Dr. Michael Pencina at the Duke Clinical Research Institute, and luckily he had used this approach on clinical data from the Framingham study. He and Dr. Philip Schulte, both statisticians, worked with me on this project, using clustering procedures from SAS\/STAT software (PROC CLUSTER). They had extensive training in SAS and were able to implement the programming on clinical variables.<\/p>\n<p><b><i>Krumholz and Ryan:<\/i><\/b><b> Please explain what the software does.<\/b><\/p>\n<p><b><i>Ahmad:<\/i><\/b> This was an agglomerative, hierarchical clustering of patients. Agglomeration is a \u201cbottom up\u201d approach, which means we start with each patient as his or her own singleton cluster and iteratively combine clusters until all patients exist in one large cluster. The process is hierarchical in that when two clusters are combined, they are never separated or rearranged in later iterations. At any particular iteration in the process, we define a distance between every pair of clusters (using Ward\u2019s minimum variance method), and the pair with the smallest distance is merged together to form a new cluster. The iterative process can be stopped according to various criteria, before merging together heterogeneous clusters. The bottom line (in non-mathematical terms) is that cluster analysis groups together sets of objects (in this case, patients) in such a way that those in the same group are more similar to one another than they are to those in other groups.<\/p>\n<p><b><i>Krumholz and Ryan:<\/i><\/b><b> Cluster 3 patients had a lower mortality risk but perhaps a higher risk for hospitalization, compared with other clusters. Did this surprise you, and what conclusions can you draw from this observation?<\/b><\/p>\n<p><b><i>Ahmad:<\/i><\/b> This was a very interesting observation. Cluster 3 fit the profile of patients we commonly see in practice \u2014 those with ischemic cardiomyopathy and profound angina who have frequent admissions for chest pain and get treated for \u201cacute coronary syndromes.\u201d We found that these patients, even though their overall mortality risk might have been intermediate, had a terrible quality of life and a profound degree of angina. Indeed, if these realities were driving their frequent admissions, a focused approach on improving symptoms might keep them out of the hospital and improve their overall quality of life. The disease process in these patients may differ greatly from that of patients with nonischemic cardiomyopathy or those with multiple comorbidities, such as COPD and renal failure. Nonetheless, we currently apply similar interventions across the board. In the future, perhaps subcategorizing heart failure patients according to data-driven phenotypes rather than current subjective classification systems, and modifying or developing therapies accordingly, might improve patients\u2019 quality of life and clinical outcomes.<\/p>\n<p><b><i>Krumholz and Ryan:<\/i><\/b><b> How do you apply your cluster groups to decision making in clinical practice? What about validation \u2014 are your findings replicable?<\/b><\/p>\n<p><b><i>Ahmad:<\/i><\/b> Dr. Krumholz ably described the clinical implications of this approach in his extremely <a href=\"http:\/\/www.ncbi.nlm.nih.gov\/pubmed\/25006142\">well-written article on big data in <i>Health Affairs<\/i><\/a> earlier this year. He wrote:<\/p>\n<blockquote><p><i>\u2026 researchers can use approaches that are designed to reveal clusters of patient groups that might suggest new taxonomies of disease based on how similar they are according to a broad range of characteristics, including outcomes. It may be, for instance, that based on biological, clinical, behavioral, and outcomes data there are many more types of diabetes than previously appreciated. The empirical classification could be shown to have value in selecting treatment strategies and predicting outcomes. This knowledge can be useful even in advance of understanding the underlying mechanisms of disease and response to therapy.<\/i><\/p><\/blockquote>\n<p>Clinicians have known for some time that heart failure is not a single disease but, rather, a syndrome that comprises several disease subtypes. Historically, we have classified heart failure according to measures (e.g., LVEF and NYHA class) that do not adequately capture its phenotypic variation. However, now that we have access to large amounts of patient data, as well as the computational capability to analyze those data, I suspect we will use similar approaches to redefine the syndrome in a way that is closer to \u201cthe truth.\u201d Clusters of heart failure are likely to vary according to the patient population being examined. For example, Yale might have different clusters of patients than Duke, and a risk-factor score or a therapy developed from a large trial might apply differently to those two patient groups. Currently, we do not take this possibility into account when caring for patients. However, this challenge is not too different from what companies such as Target, Netflix, or Amazon face when they consider user- and location-specific information to tailor their interactions with individual customers. Perhaps, in the near future, we will be able to do the same when we care for patients with heart failure.<\/p>\n<p><b>JOIN THE DISCUSSION<\/b><\/p>\n<p><b>Share your reactions to Dr. Ahmad\u2019s data-driven analysis of clinical clusters of heart failure patients.<\/b><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Tariq Ahmad discusses his research group\u2019s study of clinical phenotypes in patients with chronic systolic heart failure.<\/p>\n","protected":false},"author":406,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[2413,287,2412],"class_list":["post-45792","post","type-post","status-publish","format-standard","hentry","category-heart-failure","tag-cluster-analysis","tag-heart-failure-2","tag-phenotypes"],"_links":{"self":[{"href":"https:\/\/blogs.nejm.org\/cardioexchange\/wp-json\/wp\/v2\/posts\/45792","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.nejm.org\/cardioexchange\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.nejm.org\/cardioexchange\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.nejm.org\/cardioexchange\/wp-json\/wp\/v2\/users\/406"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.nejm.org\/cardioexchange\/wp-json\/wp\/v2\/comments?post=45792"}],"version-history":[{"count":0,"href":"https:\/\/blogs.nejm.org\/cardioexchange\/wp-json\/wp\/v2\/posts\/45792\/revisions"}],"wp:attachment":[{"href":"https:\/\/blogs.nejm.org\/cardioexchange\/wp-json\/wp\/v2\/media?parent=45792"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.nejm.org\/cardioexchange\/wp-json\/wp\/v2\/categories?post=45792"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.nejm.org\/cardioexchange\/wp-json\/wp\/v2\/tags?post=45792"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}