Winter Special Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: dumps65

Snowflake DSA-C02 Dumps

Page: 1 / 7
Total 65 questions

SnowPro Advanced: Data Scientist Certification Exam Questions and Answers

Question 1

Secure Data Sharing do not let you share which of the following selected objects in a database in your account with other Snowflake accounts?

Options:

A.

Sequences

B.

Tables

C.

External tables

D.

Secure UDFs

Question 2

Mark the incorrect statement regarding Python UDF?

Options:

A.

Python UDFs can contain both new code and calls to existing packages

B.

For each row passed to a UDF, the UDF returns either a scalar (i.e. single) value or, if defined as a table function, a set of rows.

C.

A UDF also gives you a way to encapsulate functionality so that you can call it repeatedly from multiple places in code

D.

A scalar function (UDF) returns a tabular value for each input row

Question 3

You previously trained a model using a training dataset. You want to detect any data drift in the new data collected since the model was trained.

What should you do?

Options:

A.

Create a new dataset using the new data and a timestamp column and create a data drift monitor that uses the training dataset as a baseline and the new dataset as a target.

B.

Create a new version of the dataset using only the new data and retrain the model.

C.

Add the new data to the existing dataset and enable Application Insights for the service where the model is deployed.

D.

Retrained your training dataset after correcting data outliers & no need to introduce new data.

Question 4

All Snowpark ML modeling and preprocessing classes are in the ________ namespace?

Options:

A.

snowpark.ml.modeling

B.

snowflake.sklearn.modeling

C.

snowflake.scikit.modeling

D.

snowflake.ml.modeling

Question 5

What Can Snowflake Data Scientist do in the Snowflake Marketplace as Provider?

Options:

A.

Publish listings for free-to-use datasets to generate interest and new opportunities among the Snowflake customer base.

B.

Publish listings for datasets that can be customized for the consumer.

C.

Share live datasets securely and in real-time without creating copies of the data or im-posing data integration tasks on the consumer.

D.

Eliminate the costs of building and maintaining APIs and data pipelines to deliver data to customers.

Question 6

Which of the following metrics are used to evaluate classification models?

Options:

A.

Area under the ROC curve

B.

F1 score

C.

Confusion matrix

D.

All of the above

Question 7

Which of the Following is not type of Windows function in Snowflake?

Options:

A.

Rank-related functions.

B.

Window frame functions.

C.

Aggregation window functions.

D.

Association functions.

Question 8

What is the formula for measuring skewness in a dataset?

Options:

A.

MEAN - MEDIAN

B.

MODE - MEDIAN

C.

(3(MEAN - MEDIAN))/ STANDARD DEVIATION

D.

(MEAN - MODE)/ STANDARD DEVIATION

Question 9

Which Python method can be used to Remove duplicates by Data scientist?

Options:

A.

remove_duplicates()

B.

duplicates()

C.

drop_duplicates()

D.

clean_duplicates()

Question 10

To return the contents of a DataFrame as a Pandas DataFrame, Which of the following method can be used in SnowPark API?

Options:

A.

REPLACE_TO_PANDAS

B.

SNOWPARK_TO_PANDAS

C.

CONVERT_TO_PANDAS

D.

TO_PANDAS

Question 11

Which ones are the type of visualization used for Data exploration in Data Science?

Options:

A.

Heat Maps

B.

Newton AI

C.

Feature Distribution by Class

D.

2D-Density Plots

E.

Sand Visualization

Question 12

There are a couple of different types of classification tasks in machine learning, Choose the Correct Classification which best categorized the below Application Tasks in Machine learning?

· To detect whether email is spam or not

· To determine whether or not a patient has a certain disease in medicine.

· To determine whether or not quality specifications were met when it comes to QA (Quality Assurance).

Options:

A.

Multi-Label Classification

B.

Multi-Class Classification

C.

Binary Classification

D.

Logistic Regression

Question 13

Which type of Python UDFs let you define Python functions that receive batches of input rows as Pandas DataFrames and return batches of results as Pandas arrays or Series?

Options:

A.

MPP Python UDFs

B.

Scaler Python UDFs

C.

Vectorized Python UDFs

D.

Hybrid Python UDFs

Question 14

Which type of Machine learning Data Scientist generally used for solving classification and regression problems?

Options:

A.

Supervised

B.

Unsupervised

C.

Reinforcement Learning

D.

Instructor Learning

E.

Regression Learning

Question 15

Which of the following is a Python-based web application framework for visualizing data and analyzing results in a more efficient and flexible way?

Options:

A.

StreamBI

B.

Streamlit

C.

Streamsets

D.

Rapter

Question 16

Consider a data frame df with columns ['A', 'B', 'C', 'D'] and rows ['r1', 'r2', 'r3']. What does the ex-pression df[lambda x : x.index.str.endswith('3')] do?

Options:

A.

Returns the row name r3

B.

Results in Error

C.

Returns the third column

D.

Filters the row labelled r3

Question 17

Which are the following additional Metadata columns Stream contains that could be used for creating Efficient Data science Pipelines & helps in transforming only the New/Modified data only?

Options:

A.

METADATA$ACTION

B.

METADATA$FILE_ID

C.

METADATA$ISUPDATE

D.

METADATA$DELETE

E.

METADATA$ROW_ID

Question 18

Which of the learning methodology applies conditional probability of all the variables with respec-tive the dependent variable?

Options:

A.

Reinforcement learning

B.

Unsupervised learning

C.

Artificial learning

D.

Supervised learning

Question 19

Which one is not Types of Feature Scaling?

Options:

A.

Economy Scaling

B.

Min-Max Scaling

C.

Standard Scaling

D.

Robust Scaling

Page: 1 / 7
Total 65 questions