Any views expressed within media held on this service are those of the contributors, should not be taken as approved or endorsed by the University, and do not necessarily reflect the views of the University in respect of any particular issue.

Week 7-8

This week I mainly focus on my Insights through Data’s final group project work. Firstly, we find a dataset related to vote rate. We try to find the correlation between GDP and vote rate. However, one this is that different countries will have different time to hold the election, which is difficult for us to compare the different in the same year. Besides, we also want to compare the differences among countries. Thus, we need more data to support us to do the data analysis while some data is hard to find through a reliable source. Finally, we decided to change to another dataset named WHO. We pay attention to the relationships among variables like status (Nominal), life_expectancy (Ratio), percentage_expenditure (Ratio) and so on. This dataset is big and reliable, so we directly use this one to do the analyses. The first step is to clean the data, which means we need to remove or delete the missing data and outliers. We need to consider the following parts:
1. Detect the outliers
• Boxplots/histograms
• Tukey’s Method
2. Deal with outliers
• Drop outliers?
• Limit/Winsorize outliers?
• Transform the data using log/inverse/square root/etc?

Since each variable has a unique amount of outliers and also has outliers on different sides of the data, the best route to take is probably winsorizing (limiting) the values for each variable on its own until no outliers remain. After this step, we can do the data exploration.

Through group work, I figure out a lot of problems. I try to use code and programming to analyze realistic problem.

For another course——Clonality of Data, we use a new form to present our ideas. We choose poster which is totally a new method to solve the problem. Muslin is a absolutely new area knowledge for me. I struggled for a period. Thanks to my team member Emily, she gave me a lot of explanations and advice of the topic. I start to find it is not so hard to understand.

I changed my project’s outline. I think it’s hard to find inequality in social media and data analysis. One of the more interesting arguments I found in the course inside through data is about data bias and algorithmic bias. I think I would prefer to look at algorithmic bias. I want to find a specific example. In-depth analysis of this example will help me solve the algorithm bias research. I want to start from its causes, combined with social factors and technical dilemmas to elaborate on the algorithm bias. And then some discussion about his influence. I am currently concerned about. Gender bias in the workplace and racial inequality in the health care system. This example is quite interesting, but I still need some time to make clear which example to refer to for analysis

1 reply to “Week 7-8”

  1. Hanyu Wang says:

    Thank you for updating your thoughts on your final project. You seem to have further narrowed down your topic, which is really good. I wish you could find a specific case to discuss very soon:). Have a nice week!

Leave a reply

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

css.php

Report this page

To report inappropriate content on this page, please use the form below. Upon receiving your report, we will be in touch as per the Take Down Policy of the service.

Please note that personal data collected through this form is used and stored for the purposes of processing this report and communication with you.

If you are unable to report a concern about content via this form please contact the Service Owner.

Please enter an email address you wish to be contacted on. Please describe the unacceptable content in sufficient detail to allow us to locate it, and why you consider it to be unacceptable.
By submitting this report, you accept that it is accurate and that fraudulent or nuisance complaints may result in action by the University.

  Cancel