Data Analysis

Read Complete Research Material

Data Analysis

Data Analysis



Data Analysis

Part 1:

The above table shows the descriptive statistics for the given data. It can be seen that purple has the highest mean and it can be observed that it also has the highest standard deviation which indicates this fact that the given data set is too much scattered and there is also a chance of having outlier.

It lead us to this fact that mean is not a good tool to find out about the central tendency and the recommended tool is median for this data set.

Skewness:

Skewness is a parameter that explains asymmetry in a random variable's probability distribution. From the table it can be find out that all the given distribution are positively skewed.



Tb: Descriptive of type A data

In 1977, John Tukey published an well-organized method for displaying a five-number data synopsis.

The graph is called a boxplot (also known as a box as well as whisker plot) and summarizes

the following statistical measures:

•median

•upper and lower quartiles

•minimum and maximum data values

The boxplot is understood as follows:

The carton itself comprises the middle 50% of the data. The top for demonstration (hinge) of the carton shows the 75th percentile of the facts and numbers set, and the smaller hinge shows the 25th percentile. The variety of the middle two quartiles is renowned as the inter-quartile range.

The line in the carton shows the median worth of the data.

If the median line inside the carton is not equidistant from the hinges, then the facts and numbers is skewed.

The finishes of the upright lines or "whiskers" show the smallest and greatest facts and numbers standards, except outliers are present in which case the whiskers continue to a greatest of 1.5 times the inter-quartile range.

The points out-of-doors the finishes of the whiskers are outliers or supposed outliers.

Boxplot Enhancements

Beyond the rudimentary data, boxplots occasionally are increased to express added information:

The signify and its self-assurance gap can be shown utilising a precious gem form in the box.

The anticipated variety of the median can be shown utilising notches in the box.

The breadth of the carton can be diverse in percentage to the log of the experiment size.

Advantages of Boxplots

Boxplots have the next strengths:

Graphically brandish a variable's position and disperse at a glance.

Provide some suggestion of the data's symmetry and skewness.

Unlike numerous other procedures of facts and numbers brandish, boxplots display outliers.

By utilising a boxplot for each categorical variable side-by-side on the identical graph, one rapidly can contrast facts and numbers sets.

One drawback of boxplots is that they are inclined to focus the follows of a circulation, which are the smallest certain points in the facts and numbers set. They furthermore conceal numerous of the minutia of the distribution. Displaying a histogram in conjunction with the boxplot assists in this consider, and both are significant devices for exploratory facts and numbers analysis.

From the graph it can be seen that median is not equidistant from the hinges so it can be concluded that data is ...
Related Ads