CertNexus Certified Artificial Intelligence Practitioner (CAIP) Questions and Answers
Which of the following pieces of AI technology provides the ability to create fake videos?
You create a prediction model with 96% accuracy. While the model's true positive rate (TPR) is performing well at 99%, the true negative rate (TNR) is only 50%. Your supervisor tells you that the TNR needs to be higher, even if it decreases the TPR. Upon further inspection, you notice that the vast majority of your data is truly positive.
What method could help address your issue?
The graph is an elbow plot showing the inertia or within-cluster sum of squares on the y-axis and number of clusters (also called K) on the x-axis, denoting the change in inertia as the clusters change using k-means algorithm.
What would be an optimal value of K to ensure a good number of clusters?
Which type of regression represents the following formula: y = c + b*x, where y = estimated dependent variable score, c = constant, b = regression coefficient, and x = score on the independent variable?
Which of the following items should be included in a handover to the end user to enable them to use and run a trained model on their own system? (Select three.)
Which of the following unsupervised learning models can a bank use for fraud detection?
For each of the last 10 years, your team has been collecting data from a group of subjects, including their age and numerous biomarkers collected from blood samples. You are tasked with creating a prediction model of age using the biomarkers as input. You start by performing a linear regression using all of the data over the 10-year period, with age as the dependent variable and the biomarkers as predictors.
Which assumption of linear regression is being violated?
Which of the following can benefit from deploying a deep learning model as an embedded model on edge devices?
Which of the following tests should be performed at the production level before deploying a newly retrained model?
We are using the k-nearest neighbors algorithm to classify the new data points. The features are on different scales.
Which method can help us to solve this problem?
Which of the following is NOT a valid cross-validation method?
Which of the following is TRUE about SVM models?
Which of the following regressions will help when there is the existence of near-linear relationships among the independent variables (collinearity)?
Which of the following models are text vectorization methods? (Select two.)
R-squared is a statistical measure that:
When working with textual data and trying to classify text into different languages, which approach to representing features makes the most sense?
A dataset can contain a range of values that depict a certain characteristic, such as grades on tests in a class during the semester. A specific student has so far received the following grades: 76,81, 78, 87, 75, and 72. There is one final test in the semester. What minimum grade would the student need to achieve on the last test to get an 80% average?
What is the open framework designed to help detect, respond to, and remediate threats in ML systems?
Which of the following describes a benefit of machine learning for solving business problems?
You and your team need to process large datasets of images as fast as possible for a machine learning task. The project will also use a modular framework with extensible code and an active developer community. Which of the following would BEST meet your needs?
What is Word2vec?
In a self-driving car company, ML engineers want to develop a model for dynamic pathing. Which of following approaches would be optimal for this task?
Below are three tables: Employees, Departments, and Directors.
Employee_Table
Department_Table
Director_Table
ID
Firstname
Lastname
Age
Salary
DeptJD
4566
Joey
Morin
62
$ 122,000
1
1230
Sam
Clarck
43
$ 95,670
2
9077
Lola
Russell
54
$ 165,700
3
1346
Lily
Cotton
46
$ 156,000
4
2088
Beckett
Good
52
$ 165,000
5
Which SQL query provides the Directors' Firstname, Lastname, the name of their departments, and the average employee's salary?
In general, models that perform their tasks:
Which of the following best describes distributed artificial intelligence?
You are building a prediction model to develop a tool that can diagnose a particular disease so that individuals with the disease can receive treatment. The treatment is cheap and has no side effects. Patients with the disease who don't receive treatment have a high risk of mortality.
It is of primary importance that your diagnostic tool has which of the following?
Which of the following metrics is being captured when performing principal component analysis?