Text mining for finding the best place to pursue medical research

Text mining for finding the best place to pursue medical research

- 1 min

Screenshot

Text mining for finding the best place pursuing medical research

This is an open-end project that I worked with a team in a Statistic class at Rice University.

Where is the best place for medical research?

We explore where the best places for health research and innovation are, and how this relates to the rates of specific illnesses.

“Where” is meant to be general and refers not only to specific locations, such as cities/states, but also specific diseases, such as obesity, diabetes, and other characteristics. We then rank the 11 top cities for health research and innovation.

In order to figure out where the best places are, we first analyze funding on three dimensions: type of disease, time, and location.

Based on this high-level analysis, we find location is the most important factor. We deep dive into location, specifically city-level analysis.

We breakdown the importance into three categories, which we call: access to talent, access to resources, and access to funding. Based on these three dimensions, we rank the top 11 cities. Based on our ranking system. New York is the top city, while Seattle is the bottom of the 11 cities.

The results are presented beautifully via ggplot and wordcloud using R!


Check it out here. If you need some help, just tell me.

rss facebook twitter github gitlab youtube mail spotify lastfm instagram linkedin google google-plus pinterest medium vimeo stackoverflow reddit quora quora