**Course Name :- Machine Learning – Dimensionality Reduction**

**Module 1:- Data Series**

**Question 1 : Which of the following techniques can be used to reduce the dimensions of the population?**

- Exploratory Data Analysis
- Principal Component Analysis
- Exploratory Factor Analysis
**Cluster Analysis**

**Question 2: Cluster Analysis partitions the columns of the data, whereas principal component and exploratory factor analyses partition the rows of the data. True or false?**

**False**- True

**Question 3: Which of the following options are true? Select all that apply.**

- PCA explains the total variance
- EFA explains the common variance
- EFA identifies measures that are sufficiently similar to each other to justify combination
- PCA captures latent constructs that are assumed to cause variance

**Module 2 : -Data Refinement**

**Question 1 : Which of the following options is true?**

**A matrix of correlations describes all possible pairwise relationships**- Eigenvalues are the principal components
- Correlation does not explain the covariation between two vectors
- Eigenvectors are a measure of total variance, as explained by the principal components

**Question 2: PCA is a method to reduce your data to the fewest ‘principal components’ while maximizing the variance explained. True or false?**

- False
**True**

**Question 3 : Which of the following techniques was NOT covered in this lesson?**

**Parallel analysis**- Percentage of Common Variance
- Scree Test
- Kaiser-Guttman Rule

**Module 3 :- Exploring Data**

**Question 1: EFA is commonly used in which of the following applications? Select all that apply.**

**Customer satisfaction surveys****Personality tests****Performance evaluations**- Image analysis

**Question 2 : Which of the following options is an example of an Oblique Rotation?**

- Regmax
- Varimax
- Softmax
**Promax**

**Question 3 : An Orthogonal Rotation assumes that factors are correlated with each other. True or false?**

**False**- True

**Machine Learning – Dimensionality Reduction Cognitive class final Exam Answers:-**

**Question 1: Why might you use cluster analysis as an analytic strategy?**

- To identify higher-order dimensions
- To identify outliers
- To reduce the number of variables
**To segment the market**- None of the above

**Question 2: Suppose you have 100,000 individuals in a dataset, and each individual varies along 60 dimensions. On average, the dimensions are correlated at r = .45. You want to group the** **variables together, so you decide to run principle component analysis. How many meaningful, higher-order components can you extract?**

- 60
- 3
**20**- 24

**Question 3 : What technique should you use to identify the dimensions that hang together?**

- Principal axis factoring
- Confirmatory factor analysis
**Exploratory factor analysis**- Two of the above
- None of the above

**Question 4 : What are loadings?**

- Covariance between the two factors
- Correlations between each variable and its factor
- Correlations between each variable and its component
**Two of the above**- None of the above

**Question 5 : When would you use PCA over EFA?**

- When you want to use an orthogonal rotation
**When you are interested in explaining the total variance in a variance-covariance matrix**- When you have too many variables
- When you are interested in a latent construct
- None of the above

**Question 6 : What is uniqueness?**

- A measure of replicability of the factor
**The amount of variance not explained by the factor structure**- The amount of variance explained by the factor structure
- The amount of variance explained by the factor
- None of the above

**Question 7 : Suppose you are looking to extract the major dimensions of a parrot’s personality. Which technique would you use?**

- Maximum likelihood
- Principal component analysis
- Cluster analysis
**Factor analysis**- None of the above

**Question 8 : Suppose you have 60 variables in a dataset, and you know that 2 components explain the data very well. How many components can you extract?**

- 45
- 5
**60**- 2
- None of the above

**Question 9 : When would you use an orthogonal rotation?**

- When correlations between the variables are large
- When you observe small correlations between the variables in the dataset
**When you think that the factors are uncorrelated**- All of the above
- None of the above

**Question 10 : When would you use confirmatory factor analysis?**

**When you want t**o validate the factor solution- When you want to explain the variance in the matrix accounting for the measurement error
- When you want to identify the factors
- Two of the above
- None of the above

**Question 11 : Which of the following is NOT a rule when deciding on the number of factors?**

**Newman-Frank Test**- Percentage of common variance explained
- Scree test
- Kaiser-Guttman
- None of the above

**Question 12 : What is one assumption of factor analysis?**

- A number of factors can be determined via the Scree test
- Factor analysis will extract only unique factors
**A latent variable causes the variance in observed variables**- There is no measurement error
- None of the above

**Question 13 : What is an eigenvector?**

- The proportion of the variance explained in the matrix
- A higher-order dimension that subsumes all of the lower-order errors
**A higher-order dimension that subsumes similar lower-order dimensions**- A higher-order dimension that subsumes all lower-order dimensions
- None of the above

**Question 14 : What is a promax rotation?**

- A rotation method that minimizes the square loadings on each factor
**A rotation method that maximizes the variance explained**- A rotation method that maximizes the square loadings on each factor
- A rotation method that minimizes the variance explained
- None of the above

**Question 15 : What is the cut-off point for the Common Variance Explained rule?**

- 80% of variance explained
**50% of variance explained**- 3 variables
- 1 unit
- None of the above

**Question 16 : Why would you try to reduce dimensions?**

- Individuals need to be placed into groups
- Variables are highly-correlated
**Many variables are likely assessing the same thing**- Two of the above
- All of the above

**Question 17: If you have 20 variables in a dataset, how many dimensions are there?**

**At most 20**- At least 20
- As many as the number of factors you can extract
- Not enough information
- None of the above

**Question 18 : What term describes the amount of variance of each variable explained by the factor structure?**

- Eigenvector
- Commonality
- Similarity
**Communality**- None of the above

**Question 19: What package contains the necessary functions to perform PCA and EFA?**

- ggplot2
- FA
**psych**- factAnalis
- None of the above

**Question 20 : What is the best method for identifying the number of factors to extract?**

**Parallel Analysis**- Scree test
- Newman-Frank Test
- Percentage of common variance explained
- All of the above