The aims of this assignment are to put into practice the concepts covered in lectures and apply these to a real dataset of your own choosing, and to demonstrate your ability to use R to carry out data analytics tasks.
The main steps you need to carry out and report on are:-
Selecting a dataset suitable for analysis and identifying the problem you wish to explore.
Summarising and visualising your data set.
Preparing your dataset for analysis (data cleansing, choosing appropriate features to model) etc..
Choosing an appropriate model or models for your chosen problem and building, running and evaluating these.
Presenting and interpreting the results of your models and relating these to your initial problem.
The choice of data and problem to work on are entirely up to you. There are lots of public data sets available (with some suggestions at the end of this page) and you should check with me first that the data you have chosen is appropriate. The emphasis is also very much on the process: if you find that the techniques you have chosen don’t work very well or fail to produce particularly interesting results, then this is not a problem provided you followed the appropriate steps to understand and prepare the data and select appropriate models.
Your assignment needs to be presented as a 10-page (maximum) report in PDF format, and you should also submit the R code used and developed as an appendix to this report.
This assignment is worth 50% of the marks for the class and will follow the following marking scheme:
Selecting a dataset suitable for analysis and identifying the problem you wish to explore. (5%)
Summarising and visualising your data set. (10%)
Preparing your dataset for analysis (data cleansing, choice of features) etc.. (10%)
Choice of model, evaluation and validation (15%)
Interpretation and explanation of the results of your models and implications of these for your initial problem. (10%)
Your report should be submitted by midnight on Sunday 26th February 2017.
Some hints on choosing data sets
There are huge numbers of public data sets available and here are just a few suggestions:-
http://archive.ics.uci.edu/ml/ currently has 360 data sets maintained for the machine learning community, many of which have been studied and analysed and used in research papers
http://finance.yahoo.com/market-overview/?bypass=true is a huge source of financial data (if that’s a route that appeals)
https://data.glasgow.gov.uk/ various open data sets (of varying quality) relating to Glasgow. Other cities offer similar services
Just search for something that you are interested in
No need to make things too hard for yourselves but at the same time try and avoid data sets that have already been extensively studied.
Our Service Charter
Excellent Quality / 100% Plagiarism-FreeWe employ a number of measures to ensure top quality essays. The papers go through a system of quality control prior to delivery. We run plagiarism checks on each paper to ensure that they will be 100% plagiarism-free. So, only clean copies hit customers’ emails. We also never resell the papers completed by our writers. So, once it is checked using a plagiarism checker, the paper will be unique. Speaking of the academic writing standards, we will stick to the assignment brief given by the customer and assign the perfect writer. By saying “the perfect writer” we mean the one having an academic degree in the customer’s study field and positive feedback from other customers.
Free RevisionsWe keep the quality bar of all papers high. But in case you need some extra brilliance to the paper, here’s what to do. First of all, you can choose a top writer. It means that we will assign an expert with a degree in your subject. And secondly, you can rely on our editing services. Our editors will revise your papers, checking whether or not they comply with high standards of academic writing. In addition, editing entails adjusting content if it’s off the topic, adding more sources, refining the language style, and making sure the referencing style is followed.
Confidentiality / 100% No DisclosureWe make sure that clients’ personal data remains confidential and is not exploited for any purposes beyond those related to our services. We only ask you to provide us with the information that is required to produce the paper according to your writing needs. Please note that the payment info is protected as well. Feel free to refer to the support team for more information about our payment methods. The fact that you used our service is kept secret due to the advanced security standards. So, you can be sure that no one will find out that you got a paper from our writing service.
Money Back GuaranteeIf the writer doesn’t address all the questions on your assignment brief or the delivered paper appears to be off the topic, you can ask for a refund. Or, if it is applicable, you can opt in for free revision within 14-30 days, depending on your paper’s length. The revision or refund request should be sent within 14 days after delivery. The customer gets 100% money-back in case they haven't downloaded the paper. All approved refunds will be returned to the customer’s credit card or Bonus Balance in a form of store credit. Take a note that we will send an extra compensation if the customers goes with a store credit.
24/7 Customer SupportWe have a support team working 24/7 ready to give your issue concerning the order their immediate attention. If you have any questions about the ordering process, communication with the writer, payment options, feel free to join live chat. Be sure to get a fast response. They can also give you the exact price quote, taking into account the timing, desired academic level of the paper, and the number of pages.