Reading 2: Organizing, Visualizing, and Describing Data
50 questions available
Key Points
- Numerical data: Discrete vs. Continuous.
- Categorical data: Nominal vs. Ordinal.
- Time series track one variable over time; Cross-sectional tracks multiple variables at one point in time.
- Panel data combines time series and cross-sectional dimensions.
Key Points
- Histograms visualize frequency distributions of numerical data.
- Scatter plots reveal relationships between two numerical variables.
- Heat maps use color intensity to show frequency or correlation.
- Tree maps visualize relative sizes of categories.
Key Points
- Arithmetic mean is sensitive to outliers; Median is not.
- Geometric mean is used for compound returns over time.
- Harmonic mean is used for average price per share (cost averaging).
- Order for variable data: Harmonic Mean < Geometric Mean < Arithmetic Mean.
Key Points
- Sample variance uses n-1 divisor to be an unbiased estimator.
- Coefficient of Variation (CV) = Standard Deviation / Mean.
- Positive Skew: Mean > Median > Mode.
- Leptokurtic distributions have excess kurtosis > 0 (fat tails).
Questions
Which of the following best describes discrete numerical data?
View answer and explanationA dataset contains the daily closing prices of a specific stock over the past year. This is best classified as:
View answer and explanationIn a frequency distribution, the relative frequency of an interval is calculated as:
View answer and explanationA contingency table is primarily used to analyze:
View answer and explanationWhich visualization tool is most appropriate for identifying whether a nonlinear relationship exists between two numerical variables?
View answer and explanationThe sum of the deviations of observations from their arithmetic mean is always:
View answer and explanationCalculate the weighted mean return of a portfolio consisting of 60 percent Asset A (return 10 percent) and 40 percent Asset B (return 5 percent).
View answer and explanationWhich measure of central tendency is least affected by outliers?
View answer and explanationCalculate the geometric mean return for three years with returns of 10 percent, 20 percent, and -10 percent.
View answer and explanationAn investor purchases 1,000 USD of stock each month. The share prices paid were 10, 15, and 20. The average cost per share is best calculated using the:
View answer and explanationIf a distribution is positively skewed, the relationship between the mean, median, and mode is typically:
View answer and explanationThe third quartile (Q3) of a dataset represents the value below which what percentage of observations lie?
View answer and explanationWhat is the Mean Absolute Deviation (MAD) of the returns 2 percent, 5 percent, and -1 percent?
View answer and explanationThe sample variance is calculated using a denominator of:
View answer and explanationCalculate the coefficient of variation (CV) for a stock with a mean return of 8 percent and a standard deviation of 12 percent.
View answer and explanationA distribution with excess kurtosis of 2.0 is best described as:
View answer and explanationThe correlation coefficient between two variables ranges from:
View answer and explanationWhich chart type is best for displaying the joint frequency of two categorical variables?
View answer and explanationIf the harmonic mean, geometric mean, and arithmetic mean are calculated for a dataset with variable positive values, which inequality is correct?
View answer and explanationA winsorized mean is calculated by:
View answer and explanationUnstructured data is best described as:
View answer and explanationFor a dataset with 9 observations, the position of the 3rd quartile is calculated using the formula (n+1)y/100. What is the position?
View answer and explanationA box and whisker plot specifically highlights which measure of dispersion?
View answer and explanationCalculate the sample variance for a dataset: 2, 4, 6. (Mean is 4).
View answer and explanationTarget downside deviation differs from standard deviation because it:
View answer and explanationWhich word cloud feature indicates the frequency of a word in a text dataset?
View answer and explanationOrdinal data allows for which of the following operations?
View answer and explanationA confusion matrix is a type of:
View answer and explanationWhich chart is best suited for comparing categories by size where area represents value?
View answer and explanationThe harmonic mean of 2 and 8 is:
View answer and explanationIn a unimodal distribution, if the Mode < Median < Mean, the distribution is:
View answer and explanationSpurious correlation refers to:
View answer and explanationWhat is the joint frequency of 'Monday' and 'Front Street' if the marginal frequency for Monday is 19 and Front Street is 25, given the cell value is 7?
View answer and explanationA histogram is essentially a bar chart of:
View answer and explanationIf the standard deviation of a dataset is 5 and the mean is 20, the coefficient of variation is:
View answer and explanationExcess kurtosis is calculated as:
View answer and explanationWhich measure is calculated as the sum of squared deviations from the mean divided by (n-1)?
View answer and explanationGiven returns of 10 percent, 10 percent, 40 percent, 10 percent. Which principle allows us to sum the present values of these cash flows?
View answer and explanationA bubble line chart adds a third dimension to a line chart by modifying the:
View answer and explanationWhich of the following is true for a normal distribution?
View answer and explanationIf covariance is 0.0058, standard deviation of A is 0.0529, and standard deviation of B is 0.1114, what is the correlation coefficient?
View answer and explanationPanel data typically combines which two data types?
View answer and explanationIn a box and whisker plot, the vertical line within the box represents the:
View answer and explanationTo calculate a compound annual rate of return over three years, one should use the:
View answer and explanationWhich of the following is true regarding the arithmetic mean and geometric mean for variable data?
View answer and explanationCovariance measures:
View answer and explanationA heat map uses which visual element to display data frequency?
View answer and explanationWhich of the following is NOT a property of the arithmetic mean?
View answer and explanationTo construct a frequency polygon, one plots points using:
View answer and explanationGiven a sample of returns: 30 percent, 12 percent, 25 percent, 20 percent, 23 percent. What is the median?
View answer and explanation