{"id":46628,"date":"2021-01-15T00:00:00","date_gmt":"2021-01-15T08:00:00","guid":{"rendered":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/"},"modified":"2025-11-13T12:55:09","modified_gmt":"2025-11-13T20:55:09","slug":"exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r","status":"publish","type":"post","link":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/","title":{"rendered":"Exploratory Data Analysis and Visualization using the BRFSS Dataset in R"},"content":{"rendered":"<h2>About the Dataset<\/h2>\n<p>BRFSS stands for Behavioural Risk Factor Surveillance system. The objective of the BRFSS is to assess behavioural risk factors for non-institutionalized adults (age>=18) residing in the United States. To collect these health instances, a total number of 50 states within the US orchestrates telephone-based surveys which selects a random adult from the household.<\/p>\n<p>Based on this information, we could say that Random Sampling is followed. Therefore, the conclusion obtained from this sample could be generalized to a larger and relevant population.<\/p>\n<p>However, since we\u00e2\u20ac\u2122re conducting surveys, which means it would also depend on the adult whether or not to volunteer, we can not say with certainty that the assignment is completely random. Establishing a causal relation between two parameters based on this data, thus, would not be appropriate.<\/p>\n<p>There are a total of 330 parameters in the dataset. These parameters are responsible for noting down the age, physical health, mental health, medication, etc. The total number of instances present in the dataset are 4,91,775.<\/p>\n<p>We would be using this public dataset for the year 2013 which can be downloaded here from <a href=\"https:\/\/www.kaggle.com\/cdc\/behavioral-risk-factor-surveillance-system?select=2013.csv\">Kaggle<\/a>. The website has also provided more recent versions of this <a href=\"https:\/\/www.kaggle.com\/cdc\/behavioral-risk-factor-surveillance-system\">dataset<\/a> from 2015 both in CSV and JSON formats, nevertheless, this tutorial would be extensible to any dataset.<\/p>\n<h2>Environment Setup<\/h2>\n<p>The following tutorial is carried out in RStudio (R version 4.0.2) on Windows 10. However, the syntax would remain the same irrespective of the Operating System used.<\/p>\n<h3>Install and Load the necessary packages<\/h3>\n<div class=\"clipboard\">\n<pre><code class=\"lang-python\">install.packages(\"dplyr\")\ninstall.packages(\"ggplot2\")\nlibrary(dplyr)\nlibrary(ggplot2)<\/code><\/pre>\n<\/div>\n<h3>Load the Dataset<\/h3>\n<p>The dataset needs to be in the same directory as your R Working directory, otherwise, the below-mentioned command would output an error message.<\/p>\n<p><code>load(\"brfss2013.RData\")<\/code><\/p>\n<p><code>.RData<\/code> is an extension of datasets particularly belonging to R. <\/p>\n<h3>Troubleshooting<\/h3>\n<p>Get your working directory using:<\/p>\n<p><code>getwd()<\/code><\/p>\n<p>In case your working directory is different from the location of your dataset, set your working directory using the following command. Alternatively, move the dataset to the above output location.<\/p>\n<p><code>setwd(dir)<\/code><\/p>\n<p>Where dir is a character string specifying your desired path. For more information on how to set your path, <a href=\"https:\/\/support.rstudio.com\/hc\/en-us\/articles\/200711843-Working-Directories-and-Workspaces\">see here<\/a>.<\/p>\n<h2>An easy alternative &#8211; GridDB<\/h2>\n<p>The above-mentioned dataset is nearly 600 MB. While this may not seem a big deal, uploading such an enormous amount of data on RCloud does not result in a fruitful outcome. Instead, the server takes a lot of time uploading this dataset due to which it eventually times out. This chunk of information is important because the tutorial we have carried out needs to be done using your local system. This does not leave room for scalability, especially if you\u00e2\u20ac\u2122re working in an organization which more often than not involves collaborating with multiple people. <\/p>\n<p>To solve this problem, we have a highly scalable and just the right solution for your data &#8211; <a href=\"https:\/\/griddb.net\/en\/\">GridDB<\/a>. GridDB is a highly scalable, reliable and relatively faster tool for your data storage. It also supports a number of programming languages including Java, C, Python, etc.<\/p>\n<p>Check out our quick tutorial on setting up Python-GridDB client <a href=\"https:\/\/docs.griddb.net\/gettingstarted\/python\/\">here<\/a>. Alternatively, you can <a href=\"https:\/\/griddb.net\/en\/downloads\/\">manually download<\/a> this open-source database especially optimized for IoT and Big Data.<\/p>\n<h2>Filtering and Visualization<\/h2>\n<p>Our foremost research focus for this tutorial is to visualize the distribution of a person\u00e2\u20ac\u2122s general health and whether or not it varies with a person\u00e2\u20ac\u2122s marital status. This research question would focus on two parameters from this dataset: General Health parameter (denoted by genhlth in the dataset) and the marital status (denoted by marital in the dataset).<\/p>\n<p>General Health of a person is subdivided into 5 main categories &#8211; Excellent, Very Good, Good, Fair, Poor and for some of the entries, this information might not be available (<code>&lt;NA&gt;<\/code>).<\/p>\n<p>Similarly, the possible values for the marital status are: Married, Divorced, Widowed, Separated, Never Married, or a member of an unmarried couple. Again, some of the participants might not have responded and those entries would have been filled with <code>&lt;NA&gt;<\/code>.<\/p>\n<p>Let us first review what is the distribution of people under the general health category.<\/p>\n<div class=\"clipboard\">\n<pre><code class=\"lang-python\">brfss2013 %>% \n    group_by(genhlth) %>% \n    summarise(count = n())<\/code><\/pre>\n<\/div>\n<p>Gives the following output:<\/p>\n<pre>## # A tibble: 6 x 2\n##   genhlth    count\n##   <fct>      <int>\n## 1 Excellent  85482\n## 2 Very good 159076\n## 3 Good      150555\n## 4 Fair       66726\n## 5 Poor       27951\n## 6 <NA>        1985<\/pre>\n<p>We can see that 1985 people have not responded to the question at hand. Hence, we will be filtering out such entries.<\/p>\n<div class=\"clipboard\">\n<pre><code class=\"lang-python\">brfss2013 %>% \n  filter(genhlth != \"NA\") %>%\n  group_by(genhlth) %>% \n  summarise(count = n())<\/code><\/pre>\n<\/div>\n<p>The output should look something like this now.<\/p>\n<pre>## # A tibble: 5 x 2\n##   genhlth    count\n##   <fct>      <int>\n## 1 Excellent  85482\n## 2 Very good 159076\n## 3 Good      150555\n## 4 Fair       66726\n## 5 Poor       27951<\/pre>\n<p>We shall now have a look at the other parameter of interest: Marital Status. Again, we will be filtering out any missing entries.<\/p>\n<div class=\"clipboard\">\n<pre><code class=\"lang-python\">brfss2013 %>% \n  filter(marital!=\"NA\") %>%\n  group_by(marital) %>% \n  summarise(count = n())<\/code><\/pre>\n<\/div>\n<pre>## # A tibble: 6 x 2\n##   marital                          count\n##   <fct>                            <int>\n## 1 Married                         253329\n## 2 Divorced                         70376\n## 3 Widowed                          65745\n## 4 Separated                        10662\n## 5 Never married                    75070\n## 6 A member of an unmarried couple  13173<\/pre>\n<p>Now, for visualizing every possible combination, we would need to aggregate this data based on these two parameters somehow. For this purpose, the dplyr library offers a function count which will be used to count people with excellent health and compare their marital status.<\/p>\n<div class=\"clipboard\">\n<pre><code class=\"lang-python\">agg = count(brfss2013, genhlth, marital) %>%\n  filter(genhlth!=\"NA\", marital!=\"NA\")<\/code><\/pre>\n<\/div>\n<p>Let\u00e2\u20ac\u2122s see what agg has stored.<\/p>\n<pre>head(agg)\n-->\n##     genhlth                         marital     n\n## 1 Excellent                         Married 49682\n## 2 Excellent                        Divorced 10049\n## 3 Excellent                         Widowed  7007\n## 4 Excellent                       Separated  1221\n## 5 Excellent                   Never married 14419\n## 6 Excellent A member of an unmarried couple  2364<\/pre>\n<p>Now that we have the count, let\u00e2\u20ac\u2122s see the proportion of married men\/women to their health.<\/p>\n<div class=\"clipboard\">\n<pre><code class=\"lang-python\">ggplot(agg %>%\n         filter(marital==\"Married\")) + geom_bar(aes(x = genhlth, y = n), stat = \"identity\")<\/code><\/pre>\n<\/div>\n<p>The above command gives the following output:<\/p>\n<p><a href=\"https:\/\/griddb.net\/wp-content\/uploads\/2021\/01\/chart.png\"><img fetchpriority=\"high\" decoding=\"async\" src=\"https:\/\/griddb.net\/wp-content\/uploads\/2021\/01\/chart.png\" alt=\"\" width=\"940\" height=\"538\" class=\"aligncenter size-full wp-image-27196\" srcset=\"\/wp-content\/uploads\/2021\/01\/chart.png 940w, \/wp-content\/uploads\/2021\/01\/chart-300x172.png 300w, \/wp-content\/uploads\/2021\/01\/chart-768x440.png 768w, \/wp-content\/uploads\/2021\/01\/chart-150x85.png 150w, \/wp-content\/uploads\/2021\/01\/chart-600x343.png 600w\" sizes=\"(max-width: 940px) 100vw, 940px\" \/><\/a><\/p>\n<p>Similarly, you could visualize this proportion for people with marital status other than married. Go ahead, plot some fancy plots with this simple code!<\/p>\n<h2>Conclusion<\/h2>\n<p>In this tutorial, we analyzed the public <a href=\"https:\/\/www.kaggle.com\/cdc\/behavioral-risk-factor-surveillance-system\">BRFSS dataset<\/a> using the R programming language. We discussed some limitations of using a huge dataset on your local system and an easy, much faster <a href=\"https:\/\/griddb.net\/en\/\">alternative<\/a>.<\/p>\n<p>Following our discussion, we also saw how to filter out the missing information in R after which we plotted a Bar plot based upon two parameters using the \u00e2\u20ac\u02dc<a href=\"https:\/\/ggplot2.tidyverse.org\/reference\/ggplot.html\">ggplot<\/a>\u00e2\u20ac\u2122 library.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>About the Dataset BRFSS stands for Behavioural Risk Factor Surveillance system. The objective of the BRFSS is to assess behavioural risk factors for non-institutionalized adults (age>=18) residing in the United States. To collect these health instances, a total number of 50 states within the US orchestrates telephone-based surveys which selects a random adult from the [&hellip;]<\/p>\n","protected":false},"author":41,"featured_media":26421,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[121],"tags":[],"class_list":["post-46628","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-blog"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.1.1 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Exploratory Data Analysis and Visualization using the BRFSS Dataset in R | GridDB: Open Source Time Series Database for IoT<\/title>\n<meta name=\"description\" content=\"About the Dataset BRFSS stands for Behavioural Risk Factor Surveillance system. The objective of the BRFSS is to assess behavioural risk factors for\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Exploratory Data Analysis and Visualization using the BRFSS Dataset in R | GridDB: Open Source Time Series Database for IoT\" \/>\n<meta property=\"og:description\" content=\"About the Dataset BRFSS stands for Behavioural Risk Factor Surveillance system. The objective of the BRFSS is to assess behavioural risk factors for\" \/>\n<meta property=\"og:url\" content=\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/\" \/>\n<meta property=\"og:site_name\" content=\"GridDB: Open Source Time Series Database for IoT\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/griddbcommunity\/\" \/>\n<meta property=\"article:published_time\" content=\"2021-01-15T08:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-11-13T20:55:09+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/wp-content\/uploads\/2020\/03\/chart.png\" \/>\n\t<meta property=\"og:image:width\" content=\"739\" \/>\n\t<meta property=\"og:image:height\" content=\"457\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"griddb-admin\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@GridDBCommunity\" \/>\n<meta name=\"twitter:site\" content=\"@GridDBCommunity\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"griddb-admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/\"},\"author\":{\"name\":\"griddb-admin\",\"@id\":\"https:\/\/griddb.net\/en\/#\/schema\/person\/4fe914ca9576878e82f5e8dd3ba52233\"},\"headline\":\"Exploratory Data Analysis and Visualization using the BRFSS Dataset in R\",\"datePublished\":\"2021-01-15T08:00:00+00:00\",\"dateModified\":\"2025-11-13T20:55:09+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/\"},\"wordCount\":882,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/griddb.net\/en\/#organization\"},\"image\":{\"@id\":\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#primaryimage\"},\"thumbnailUrl\":\"\/wp-content\/uploads\/2020\/03\/chart.png\",\"articleSection\":[\"Blog\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/\",\"url\":\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/\",\"name\":\"Exploratory Data Analysis and Visualization using the BRFSS Dataset in R | GridDB: Open Source Time Series Database for IoT\",\"isPartOf\":{\"@id\":\"https:\/\/griddb.net\/en\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#primaryimage\"},\"thumbnailUrl\":\"\/wp-content\/uploads\/2020\/03\/chart.png\",\"datePublished\":\"2021-01-15T08:00:00+00:00\",\"dateModified\":\"2025-11-13T20:55:09+00:00\",\"description\":\"About the Dataset BRFSS stands for Behavioural Risk Factor Surveillance system. The objective of the BRFSS is to assess behavioural risk factors for\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#primaryimage\",\"url\":\"\/wp-content\/uploads\/2020\/03\/chart.png\",\"contentUrl\":\"\/wp-content\/uploads\/2020\/03\/chart.png\",\"width\":739,\"height\":457,\"caption\":\"GridDB Performance using Google Cloud Platform\"},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/griddb.net\/en\/#website\",\"url\":\"https:\/\/griddb.net\/en\/\",\"name\":\"GridDB: Open Source Time Series Database for IoT\",\"description\":\"GridDB is an open source time-series database with the performance of NoSQL and convenience of SQL\",\"publisher\":{\"@id\":\"https:\/\/griddb.net\/en\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/griddb.net\/en\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/griddb.net\/en\/#organization\",\"name\":\"Fixstars\",\"url\":\"https:\/\/griddb.net\/en\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/griddb.net\/en\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/griddb.net\/wp-content\/uploads\/2019\/04\/fixstars_logo_web_tagline.png\",\"contentUrl\":\"https:\/\/griddb.net\/wp-content\/uploads\/2019\/04\/fixstars_logo_web_tagline.png\",\"width\":200,\"height\":83,\"caption\":\"Fixstars\"},\"image\":{\"@id\":\"https:\/\/griddb.net\/en\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/griddbcommunity\/\",\"https:\/\/x.com\/GridDBCommunity\",\"https:\/\/www.linkedin.com\/company\/griddb-by-toshiba\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/griddb.net\/en\/#\/schema\/person\/4fe914ca9576878e82f5e8dd3ba52233\",\"name\":\"griddb-admin\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/griddb.net\/en\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/5bceca1cafc06886a7ba873e2f0a28011a1176c4dea59709f735b63ae30d0342?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/5bceca1cafc06886a7ba873e2f0a28011a1176c4dea59709f735b63ae30d0342?s=96&d=mm&r=g\",\"caption\":\"griddb-admin\"},\"url\":\"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/author\/griddb-admin\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Exploratory Data Analysis and Visualization using the BRFSS Dataset in R | GridDB: Open Source Time Series Database for IoT","description":"About the Dataset BRFSS stands for Behavioural Risk Factor Surveillance system. The objective of the BRFSS is to assess behavioural risk factors for","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/","og_locale":"en_US","og_type":"article","og_title":"Exploratory Data Analysis and Visualization using the BRFSS Dataset in R | GridDB: Open Source Time Series Database for IoT","og_description":"About the Dataset BRFSS stands for Behavioural Risk Factor Surveillance system. The objective of the BRFSS is to assess behavioural risk factors for","og_url":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/","og_site_name":"GridDB: Open Source Time Series Database for IoT","article_publisher":"https:\/\/www.facebook.com\/griddbcommunity\/","article_published_time":"2021-01-15T08:00:00+00:00","article_modified_time":"2025-11-13T20:55:09+00:00","og_image":[{"width":739,"height":457,"url":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/wp-content\/uploads\/2020\/03\/chart.png","type":"image\/png"}],"author":"griddb-admin","twitter_card":"summary_large_image","twitter_creator":"@GridDBCommunity","twitter_site":"@GridDBCommunity","twitter_misc":{"Written by":"griddb-admin","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#article","isPartOf":{"@id":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/"},"author":{"name":"griddb-admin","@id":"https:\/\/griddb.net\/en\/#\/schema\/person\/4fe914ca9576878e82f5e8dd3ba52233"},"headline":"Exploratory Data Analysis and Visualization using the BRFSS Dataset in R","datePublished":"2021-01-15T08:00:00+00:00","dateModified":"2025-11-13T20:55:09+00:00","mainEntityOfPage":{"@id":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/"},"wordCount":882,"commentCount":0,"publisher":{"@id":"https:\/\/griddb.net\/en\/#organization"},"image":{"@id":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#primaryimage"},"thumbnailUrl":"\/wp-content\/uploads\/2020\/03\/chart.png","articleSection":["Blog"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/","url":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/","name":"Exploratory Data Analysis and Visualization using the BRFSS Dataset in R | GridDB: Open Source Time Series Database for IoT","isPartOf":{"@id":"https:\/\/griddb.net\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#primaryimage"},"image":{"@id":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#primaryimage"},"thumbnailUrl":"\/wp-content\/uploads\/2020\/03\/chart.png","datePublished":"2021-01-15T08:00:00+00:00","dateModified":"2025-11-13T20:55:09+00:00","description":"About the Dataset BRFSS stands for Behavioural Risk Factor Surveillance system. The objective of the BRFSS is to assess behavioural risk factors for","inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/griddb.net\/en\/blog\/exploratory-data-analysis-and-visualization-using-the-brfss-dataset-in-r\/#primaryimage","url":"\/wp-content\/uploads\/2020\/03\/chart.png","contentUrl":"\/wp-content\/uploads\/2020\/03\/chart.png","width":739,"height":457,"caption":"GridDB Performance using Google Cloud Platform"},{"@type":"WebSite","@id":"https:\/\/griddb.net\/en\/#website","url":"https:\/\/griddb.net\/en\/","name":"GridDB: Open Source Time Series Database for IoT","description":"GridDB is an open source time-series database with the performance of NoSQL and convenience of SQL","publisher":{"@id":"https:\/\/griddb.net\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/griddb.net\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/griddb.net\/en\/#organization","name":"Fixstars","url":"https:\/\/griddb.net\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/griddb.net\/en\/#\/schema\/logo\/image\/","url":"https:\/\/griddb.net\/wp-content\/uploads\/2019\/04\/fixstars_logo_web_tagline.png","contentUrl":"https:\/\/griddb.net\/wp-content\/uploads\/2019\/04\/fixstars_logo_web_tagline.png","width":200,"height":83,"caption":"Fixstars"},"image":{"@id":"https:\/\/griddb.net\/en\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/griddbcommunity\/","https:\/\/x.com\/GridDBCommunity","https:\/\/www.linkedin.com\/company\/griddb-by-toshiba"]},{"@type":"Person","@id":"https:\/\/griddb.net\/en\/#\/schema\/person\/4fe914ca9576878e82f5e8dd3ba52233","name":"griddb-admin","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/griddb.net\/en\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/5bceca1cafc06886a7ba873e2f0a28011a1176c4dea59709f735b63ae30d0342?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/5bceca1cafc06886a7ba873e2f0a28011a1176c4dea59709f735b63ae30d0342?s=96&d=mm&r=g","caption":"griddb-admin"},"url":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/author\/griddb-admin\/"}]}},"_links":{"self":[{"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/posts\/46628","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/users\/41"}],"replies":[{"embeddable":true,"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/comments?post=46628"}],"version-history":[{"count":1,"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/posts\/46628\/revisions"}],"predecessor-version":[{"id":51304,"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/posts\/46628\/revisions\/51304"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/media\/26421"}],"wp:attachment":[{"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/media?parent=46628"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/categories?post=46628"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/griddb-linux-hte8hndjf8cka8ht.westus-01.azurewebsites.net\/en\/wp-json\/wp\/v2\/tags?post=46628"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}