I want to do a bootcamp as a data analyst but have a totally different background. I am a lawyer who wants to work in legal tech. I applied for a bootcamp and they accepted me but i have to do a test.
I have tried to solve the test and my answers you can find below. Maybe there is someone here willing to revise my answers and correct me if im wrong. I have no experience in data analytics and dont know anyone in this field. I really want to take the bootcamp so i can advance my skills and get better job offers. i would really appreciate if someone could even skype/zoom! Thanks a lot!<3
Download assessment_data.zip file.
Extract the archive. It contains three datasets from the faux company Northwind
The datasets are called products , suppliers and categories .
Open a new Google Spreadsheet Document (https://www.google.com/sheets/about/)
Import each dataset into a sheet of the document--> for this step i just saved the file in a folder on my desktop as a csv file and uploaded it later into google sheets.Is this correct, because i excel tells me that some data might get lost, if i save it in csv.
Answer the following questions in a new sheet, named insights . Use a formula to answereach question:
What is the average unit price of all products listed?--> for this question i wrote the formula =Average(C2:C78) under the column C that was named unitprice. it is part from the dataset products.
What's the overall amount of products in stock?--> there is a column named quantity per unit and a colum named units in stock. for me it made sense multiplying the quantity per unit and the units in stock data. for example there are 10 boxes x 20 bags of chai as quantity per unit and 39 of them are in stock. if i try to multiply this data with the formula =G2*E2 then i get the error that parameter one is a text value and not a number value. if i try to convert the data into a number it still doesnt work tho.or do i only have to add all the units in stock, disregarding the quantity per unit, to answer the question. then i would go under column C(unit in stock) and put the formula: =sum(F2:F78).
Which category that has the most products listed.--> Therefore i took a look at the data set products, as well as the data set categories. in total there are 8 categories of products. thats what i got from the categories dataset. when i go to the products dataset, the products have category IDs from1-8. For the answer i would have to be able to count the number that repeats itself the most in the column category ID to say which category has the most products listed. That i can do with the COUNTIF-Formula, right? the count if formula, however, doesnt allow me to count the amounts of times a product is listed. I think im doing it wrong.
What proportion of products are discontinued?--> here i had no idea how to solve the answer by showing the proportions of the discontinued products. the colum of the dicscontinuation in the products data set shows only 0 or 1
What proportion of suppliers are from Germany?--> no idea :(7. Create a sharable link of the sheet