Category: Data Analytics

  • Title: Multiple Regression Analysis on the Factors Affecting Wrinkle Resistance of Cotton Cloth

    Complete all the writing parts of this assignment:
    The data you will use for this week’s homework is hypothetical research data on wrinkle resistance cotton cloth. In this case, a research chemist wants to understand how several predictors are associated with the wrinkle resistance of cotton cloth. The chemist examines 32 pieces of cotton cellulose produced at different settings of curing time, curing temperature, formaldehyde concentration, and catalyst ratio. The durable press rating, which is used as a measure of wrinkle resistance, is recorded for each piece of cotton.
    Instructions
    Import the data in WrinkleResistance.xlsx file into SPSS
    Create variable labels for each variable using the variable descriptions below
    Variable
    Description
    Conc
    The setting of formaldehyde concentration
    Ratio
    The catalyst ratio
    Temp
    The temperature that the sample was exposed to
    Time
    The amount of time that the sample was exposed to test conditions
    Rating
    The rating of wrinkle resistance
    Save the file as WrinkleResistance.sav
    Estimate a multiple regression model that could be used to predict the wrinkle resistance rating of cotton cloth given data on the four predictor variables. (This means write out a general model using symbols and variable names.)
    Create a scatterplot matrix for all the variables.
    Conduct a multiple regression analysis (starts with step 3 on page 159). Use the “Forward” method of selection.  
    Write out the equation for your final model (look about half-way down the first column on page 162).
    Using R2 adjusted, calculate the effect size using Cohen’s equation on the bottom of page 156. (Does SPSS do this automatically now?)
    Conduct a residual analysis (bottom of page 162).
    The write-up needs to include:
    An introduction: What is the objective of this study? What is your dependent variable? Independent variables? Who hired you? 
    Analysis section: What analysis did you do? What is your equation of the model? What is R2 and its interpretation? Cohen’s? Your scatterplot matrix, residual scatterplot, and table similar to 5.3 or 5.4 would be a part of this section. You also need to include the test statistics and p-values for all of the predictors included in final model. 
    Conclusion: This should be non-statistical. Anyone should be able to read this section and understand your conclusions. Also can include what you would recommend for future studies and any problems you encountered.
    Deliverables: 
    One Word doc. Include all syntax at the end of the document. 

  • Lucida Inc Employee Dataset Title: “Analyzing Employee Data at Lucida Inc: A Case Study” 1. Introduction Lucida Inc is a multinational company that specializes in technology and software development. The company has been in operation for over

    The Final Project consists of Lucida Inc company dataset. You are to READ the case context and use the Lucida_Employee_Dataset presented to answer the questions that follow.
    There are eleven (11) instructions and questions. Please complete all of them and present your answers as separate worksheets and in text boxes (for text answers) as they may apply. Use as many worksheets as possible. Highlight your answers as much as possible. Please, maintain the numbering system for easy follow up.
    You are permitted to use Google Scholar to support your work with references (as applicable).

  • “Regression Analysis and Predictions for Employment Sources and Flight Bookings” Predicting Bookings and Resident Ratings in Apartment Complexes “Predicting Resident Ratings: The Effects of Multiple Dogs and Year of Facility on Apartment Complexes” “Examining the Impact of Dogs and Facility Age on Resident Ratings: A Multiple Regression Analysis”

    first row of questions (please use document below for the data)
    1.
    2.Based on the above simple regression analysis, how much should the hiring managers expect to have for Newspaper/Magazine in the following January (Month 13)? Round to two decimal places, do not include the dollar sign. 3The hiring managers want to compare three different employment sources: Newspaper/Magazine, CareerBuilder, and Monster.com. Perform simple regressions as you did above predicting allocations for month 13 for those three employment sources. Which of the following statements would be the best conclusion from your analysis?
    Group of answer choicesThe model R-square for Monster.com is the highest R-square value of the three platforms. Therefore, management will likely provide the most funding for that platform, and we should focus on that above the other two.The models for Monster.com and Newspaper/Magazine are both significant, which means we should focus on these two platforms over CareerBuilder because management will likely provide us the most funding for those two platforms.
    The predicted monthly amount for CareerBuilder is lower than the predicted amount for Newspaper/Magazine. Therefore, they should expect the lowest amount of funding for CareerBuilder out of all of the three, and should concentrate on their efforts on the other two employment resources.
    The monthly amounts for Monster.com have a negative slope, compared to the other two, which have a positive slope. Therefore, they should expect lower funding for Monster.com than in the past, and should concentrate their efforts on the other two employment resources.
    All of the above are correct.
    None of the above are correct.
    4For ease of performing the next task, move the variable “Social Networks – Facebook, Twitter, etc” to column A on the spreadsheet. (i.e. Make sure it is the first column in the spreadsheet.)
    The hiring managers recognize that the funding for one employment source depends on the funding for the other sources. Therefore, they want you to perform a multiple regression predicting Social Network funding from Months 1-12, while controlling for the funding of Billboard, Careerbuilder, Company Intranet – Partner, and Diversity Job Fair. Assume that we know the funding for the the controlled variables for Month 13. They are: Billboard: 520
    Careerbuilder: 800
    Company Intranet: 0
    Diversity Job Fair: 1000
    For this model and these known values, what is the predicted value for Social Network funding for Month 13? 5.Compare the performance of the model above with a model that just controls for the funding of Billboard. In other words, create a multiple regression predicting Social Network funding from Months 1-12, while controlling for the funding of Billboard. Compare the Regression Statistics. Which model does a better job of explaining the variance in Social Network funding? Group of answer choicesThe model that just controls for BillboardThe model that controls for Billboard, Careerbuilder, Company Intranet, and Diversity Job Fair
    second row
    1
    xiyi
    13
    27
    35
    411
    514
    Which of the following is a scatter diagrams accurately represents the data above?
    Group of answer choicesNone of the diagramsDiagram 1
    Diagram 2
    Diagram 3
    2Find the slope (b1) for the regression equation for the following values. Round to 3 decimal places. Define Variablesxiyi
    33180
    25
    170
    50200
    65155
    57160
    27165
    3Try to approximate the relationship between x and y by drawing a straight line through the data. Which of the following is a scatter diagrams accurately represents the data?
    Group of answer choicesDiagram 1Diagram 2
    Diagram 3
    None of the diagrams
    4Find the intercept (b0) for the regression equation for the following values. Round to 3 decimal places. Attention: The numbers may be different from the previous question. Interceptxiyi
    33180
    25
    170
    50200
    65192
    57160
    27165
    5Flight bookings on the Orbitz travel site fluctuate throughout the year. In the month of December, the Orbitz team knows that bookings increase throughout the month. The team is trying to predict number of bookings in a given day throughout the month. They have sampled data from a few dates out of last December and would like to predict the upcoming December bookings. The data are below. In this scenario’s regression equation, x is __________. Orbitz TravelDates in DecemberBookings in the thousands
    13
    4
    7
    74
    105
    136
    165
    198
    2210
    2513
    2714
    3016
    Group of answer choicesDate in DecemberDate in January
    16 thousand
    Bookings in the thousands
    6Flight bookings on the Orbitz travel site fluctuate throughout the year. In the month of December, the Orbitz team knows that bookings increase throughout the month. The team is trying to predict number of bookings in a given day throughout the month. They have sampled data from a few dates out of last December and would like to predict the upcoming December bookings. The data are below. Orbitz Travel SiteDates in DecemberBookings in the thousands
    13
    4
    7
    74
    105
    136
    165
    198
    2210
    2513
    2714
    3016
    If the intercept for the above data is 1.82 and the slope is 0.41, what is the complete regression equation? Group of answer choicesx = .41y + 1.82y = .41x + 1.82
    y = 1.82(.41) + x
    y = 1.82x + .41
    7A casino is interested in the relationship between the amount of alcohol people buy and the amount of money they spend at the slot machines. They have sampled 10 people and measured the amount of alcohol they drank and how much they spend that day. Based on the regression line, they would like to predict how much money people will spend based on the amount that they drink. The data are below. Alcohol by the OunceOunces purchasedDollars spent on slots
    .535
    1
    64
    2100
    173
    2.5110
    3150
    1.5130
    5300
    3130
    2.5105
    If the regression line is:
    y = 50.87x + 7.78, How much money do we predict that a person who drinks 2.94 ounces of alcohol spends? Round to 2 decimal places. Do not put a dollar sign in your answer. 8An apartment management company wants to explore the consequences of allowing residents to have multiple dogs. They would like to find out whether the number of dogs predicts resident ratings. They would also like to control for the year the apartment complex was built because they know that might also affect the resident rating. They have collected data on several of their existing complexes. For each complex, they have counted the number of dogs currently living in the complex, the year the complex was built, and the average rating for that particular complex. They would like to perform a multiple regression on these variables to predict resident ratings. See data below:
    Multiple Dog ConsiderationNumber of dogsYear of facilityRating (out of 5)
    5419752
    31
    19643.5
    020154.8
    1120113.8
    7319642.3
    2320163.7
    020154.7
    4919892.7
    In this scenario, the y (dependent variable) is: Group of answer choicesRating (out of 5)Number of Dogs
    Year of Facility
    Number of Residents
    9An apartment management company wants to explore the consequences of allowing residents to have multiple dogs. They would like to find out whether the number of dogs predicts resident ratings. They would also like to control for the year the apartment complex was built because thaty might also affect the resident rating. They have collected data on several of their existing complexes. For each complex, they have counted the number of dogs currently living in the complex, the year the complex was built, and the average rating for that particular complex. They would like to perform a multiple regression on these variables to predict resident ratings. See data below. These data are the same as the previous question. Multiple Dog ConsequencesNumber of dogsYear of facilityRating (out of 5)
    5419752
    31
    19643.5
    020154.8
    1120113.8
    7319642.3
    2320163.7
    020154.7
    4919892.7
    What is the b coefficient for Number of Dogs? Round to 3 decimal places.
    10An apartment management company wants to explore the consequences of allowing residents to have multiple dogs. They would like to find out whether the number of dogs predicts resident ratings. They would also like to control for the year the apartment complex was built because that might also affect the resident rating. They have collected data on several of their existing complexes. For each complex, they have counted the number of dogs currently living in the complex, the year the complex was built, and the average rating for that particular complex. They would like to perform a multiple regression on these variables to predict resident ratings. See data below. These data are the same as the previous question. Consequences of Multiple DogsNumber of dogsYear of facilityRating (out of 5)
    5419752
    31
    19643.5
    020154.8
    1120113.8
    7319642.3
    2320163.7
    020154.7
    4919892.7
    What is the p-value for Number of Dogs? Round to 3 decimal places.
    11An apartment management company wants to explore the consequences of allowing residents to have multiple dogs. They would like to find out whether the number of dogs predicts resident ratings. They would also like to control for the year the apartment complex was built because thaty might also affect the resident rating. They have collected data on several of their existing complexes. For each complex, they have counted the number of dogs currently living in the complex, the year the complex was built, and the average rating for that particular complex. They would like to perform a multiple regression on these variables to predict resident ratings. See data below. These data are the same as the previous question. Number of Dogs and RatingsNumber of dogsYear of facilityRating (out of 5)
    5419752
    31
    19643.5
    020154.8
    1120113.8
    7319642.3
    2320163.7
    020154.7
    4919892.7
    Fill in the blanks using the dropdown menus. The effect of Number of Dogs has a p-value [ Select ] [“more than”, “equal to”, “less than”] .05, which means it is [ Select ] [“significant”, “not significant”] . It is [ Select ] [“likely”, “unlikely”] that the effect of Number of Dogs on Resident Ratings is due to chance (i.e. not a real effect). The effect of Year of Facility has a p-value [ Select ] [“less than”, “equal to”, “more than”] .05, which means it is [ Select ] [“not significant”, “significant”] . It is [ Select ] [“unlikely”, “likely”] that the effect of Year of Facility on Resident Ratings is due to chance (i.e. not a real effect). 12An apartment management company wants to explore the consequences of allowing residents to have multiple dogs. They would like to find out whether the number of dogs predicts resident ratings. They would also like to control for the year the apartment complex was built because that might also affect the resident rating. They have collected data on several of their existing complexes. For each complex, they have counted the number of dogs currently living in the complex, the year the complex was built, and the average rating for that particular complex. They would like to perform a multiple regression on these variables to predict resident ratings. See data below. These data are the same as the previous question. Multiple Dog RatingsNumber of dogsYear of facilityRating (out of 5)
    5419752
    31
    19643.5
    020154.8
    1120113.8
    7319642.3
    2320163.7
    020154.7
    4919892.7
    What is the R Square value for this model? Round to 3 decimal places.
    13. An apartment management company wants to explore the consequences of allowing residents to have multiple dogs. They would like to find out whether the number of dogs predicts resident ratings. They would also like to control for the year the apartment complex was built because that might also affect the resident rating. They have collected data on several of their existing complexes. For each complex, they have counted the number of dogs currently living in the complex, the year the complex was built, and the average rating for that particular complex. They would like to perform a multiple regression on these variables to predict resident ratings. See data below. These data are the same as the previous question. Positive and Negative Dog RatingsNumber of dogsYear of facilityRating (out of 5)
    5419752
    31
    19643.5
    020154.8
    1120113.8
    7319642.3
    2320163.7
    020154.7
    4919892.7
    What can we conclude about this multiple regression analysis? Fill in the blanks.
    There is a [ Select ] [“negative”, “positive”] effect of Number of Dogs on Resident Ratings. There is a [ Select ] [“positive”, “negative”] effect of Year of Facility on Resident Ratings. As Number of Dogs increases, Resident Ratings [ Select ] [“decrease”, “stay the same”, “increase”] . Therefore, we recommend that the management [ Select ] [“does not make any changes to dog limits”, “allows more dogs”, “allows fewer dogs”] in their future complexes.

  • “Data Analysis and Visualization using MS Excel”

    You can do it manually or may use MS Excel to solve the questions. You need to save it as a PDF file.

  • “Creating a Frequency Distribution with 13 Classes”

    Frequency distribution homework
    Constructing Frequency distribution Constructing Frequency distribution with 13 classes

  • “Interpreting Data for Business Decision Making: A Presentation to the Organization” “Graphical Representations of Data in Business: A Comparative Analysis of Two Articles”

    Introduction
    Business administrators and managers are often called upon to interpret data that analysts have provided to them. This requires an understanding of the data sources (when, where, and how data is collected; formatted or stored; and used), as well as what that data looks like and how it can be summarized. In this first assessment, you are asked to locate any report or periodical article used in a business context of interest to you that contains at least two different graphical representations of data. You will interpret the graphical data representations and present your findings in a brief PowerPoint deck, as if you were presenting during a company meeting.
    In this assessment you will learn about the collection, formatting, and use of raw data, as well as graphical and tabular methods for summarizing it. You will also get started with the technology that you will use in this course: Microsoft Excel (including the Data Analysis ToolPak add-in).
    Scenerio
    You have been invited to present at a departmental meeting with employees from all levels within the organization. You have been allotted 6-10 minutes to speak.
    Your role
    The purpose of your speech is to explain the business context as well as two charts or tables that you have evaluated as a business analyst of the organization.
    Your business report to the group will be a slide presentation with speaker notes and appropriate citations and references.
    Instructions
    Complete the following:
    Article Identification. Use one of the articles listed under Article Options subheading below or find an article in Forbes or other business journal or an annual report from a publicly traded company that includes at least two data graphs or tables.The graphs should depict or represent data using pie charts, bar charts, tables, scatter plots, trend lines, et cetera.
    Read the article and identify the business context. Business context includes organizational history, mission, product and services, environment, competitive advantage, competition, et cetera. You can also determine business context from additional sources (and you should).The company or organizational background information should help explain why the data are relevant. This will be the introductory information for your business report, presentation, or assessment.
    Interpret your chosen data representations in the context of the business situation. The following are typical questions an analyst would use to interpret the data:What is being measured (the variables)?
    What are the relationships among the variables?
    What are the trends in the data?
    How can the data be applied in the business context?
    Create an effective 6-10 slide PowerPoint deck with detailed presenter’s notes (including citations and reference slides) elaborating on each point that will be presented at a departmental meeting. For example:Organization/business context.
    Relevance/importance of information.
    Source of data set and any limitations?
    Graphic of data 1 – with interpretations of graph.
    Graph of data 2 – with interpretations of graph.
    Importance of data analysis in terms of business context.
    Summary.
    Reference slides.
    An effective PowerPoint presentation for this purpose typically includes:
    One title slide, APA formatted.
    1-2 introduction slides explaining the business context.
    Several slides. You should copy and paste (insert) the graphs or tables and include an appropriate citation. Each slide should include detailed speaker notes.
    Several slides. Include your interpretation of each graphical data representation.
    Conclusion slides in which you explain how the data may affect the business context or how it could be applied in your business context to inform decision making.
    Slide with at least four APA-formatted references, including the source of each graph.
    Article Options
    For this assessment, you will need to choose among business articles from periodicals, annual reports of publicly traded companies, or published business reports to find two graphical representations of data.
    A list of appropriate articles has been compiled for this assessment. You may select one of the articles from the following list or find your own suitable business article containing two graphical data representations.
    Huang, N. S. (2019). Investing: The best funds for your 401K. Kiplinger’s Personal Finance, 73(12), 18-29.
    Reacting to market volatility. (2020). Kiplinger’s Personal Finance, 74(2), 54-55.
    Taxpayers weigh in on the new tax law. (2020). Kiplinger’s Personal Finance, 74(3), 50-51.
    Woodley, K. (2020). A holistic bet on housing. Kiplinger’s Personal Finance, 74(2), 31.
    Decarlo, S., Elam, D. G., Smyth, K., Agus, S., Austin, C., Hackett, R., Kowitt, B., Lashinsky, A., Lev-ram, M., Nusca, A., O’keefe, B., Roberts, J. J., & Wieczner, J. (2017). 100 fastest-growing companies. Fortune, 176(4), 157-163.
    Growing, growing… gone! (2016). Forbes, 197(5), 28.
    Lim, P. J. (2018). The 50 best mutual funds and 50 best ETFs. Money, 47(1), 86-91.
    Meet the world’s richest. (2016). Forbes, 197(4), 26-27.
    Salisbury, I. (2018). How we got here. Money, 47(1), 52-57.
    Sorvino, C. (2016). Dollar days. Forbes, 197(4), 28.
    Additional instructions
    Your written communication should be free of errors that detract from the overall message, meet APA standards, and be unbiased with documented facts rather than opinion.
    Remember to use and include at least four sources of information for your presentation.
    Evaluation
    By successfully completing this assessment, you will demonstrate your proficiency in the following course competencies through corresponding scoring guide criteria:Competency 1: Explain how data management techniques and tools are used to support business decisions.Introduce the business context.
    Explain how the data can be applied to the business context.
    Competency 4: Present the results of data analysis in clear and meaningful ways to multiple stakeholders.Interpret, or explain the meaning of, the two different graphical representations of data.
    Correctly format citations and references using current APA style.
    Present content clearly, professionally, and logically for the identified audience.

  • “Exploring R for Data Science: Discussion Questions”

    Questions based on the textbook Please mark the order of each question when answering the questions.
    https://r4ds.hadley.nz/

  • “McDonald’s Global Expansion: A Data-Driven Analysis of Strategic Leadership”

    The scope of the material investigation is McDonald’s, from 2008 to 2023, how many stores it opened, what its scale is, why it can be so large, why it chooses to open in a certain place, and McDonald’s annual report. The core content is to use various data to analyze why we can achieve the scale we have today. Assessment tasks
    McDonald’s is the world’s largest restaurant chain by revenue, serving more than 69 million customers daily at 37,855 stores in more than 100 countries as of 2018. While McDonald’s is best known for its hamburgers, cheeseburgers, and French fries, they feature chicken products, breakfast items, soft drinks, milkshakes, wraps, and desserts. You are a data solution
    Limited’s consultant, the line manager has asked you to conduct a detailed investigation of McDonald’s expansion around the world for a local newspaper.
    Using the government-based data set provided, you will design a 2000-word report that provides insights into McDonald’s world expansion.
    You will use secondary data retrieved from ONS for analysis. Your data will be analyzed using appropriate software packages (i.e. SPSS/NVIVO/Tableau/Excel) and visualized for easy interpretation. You will need to support your data set with insights from other selected sources of information and present it in a format suitable for an expert rather than a specialist audience.
    Your report should cover the following points
    1. A brief review of the literature on the topic
    2. Review data sets and their impact on strategic leadership
    3. Data analysis and visualization of datasets to provide insights using appropriate statistical packages (i.e. SPSS/NVIVO/Excel/Tableau) 4. Evidence of training completed on Linkedln Learning (SPSS, Tableau and NVIVO certificates provided in Appendix screenshot) The report should be in a paged report format and include the following:1. Title page 2. Abstract 3. Introduction 4. Findings and analysis 5. Conclusion 6. Recommendations 7. Quotes 8. Appendix

  • “Uncovering Insights: Data Analysis and Visualization for Stakeholder Presentation”

    Building on the work you completed in your Assessment 2 Case Study analysis, within which you cleaned a data set and constructed a pivot table for the organization of your choice, Assessment 3 
    requires you to expand on this analysis and deliver a formal presentation to your invested stakeholders. You are to choose from three presentation methods and deliver a 5-10 minute review and analysis of the data set you have chosen that offers real-world insights for the organization that has employed you.
    1. Shows an understanding of how to ‘read’ data and uncover insights for invested stakeholders
    2. Displays a capacity to develop pivot tables, charts, and/or graphs using Microsoft Excel
    3. Shows a capacity to develop presentation materials and visual aids that convey clear data insights
    4. Shows an ability to present clearly and persuasively to an invested stakeholder
    Complement the pivot table you created in Assessment 2 with at least three more visualizations using Microsoft Excel that help to tell a data-driven story for your stakeholder.
    Prepare your presentation. Your submission should contain graphs/tables/charts, summary points, and other visual aids that help to propel a data-driven story.
    Referring to assignment 2, this assignment 3 is where I need to create PowerPoint slides. Please ask if u have any questions!
    For the Assignment topic can change to a better one please thank you. 

  • “Improving Data Analysis Skills: Rectifying Errors in SPSS Analysis”

    As per my teacher you have to rectify errors in my document by doing SPSS analysis. Teachers comments are in attached document.