Oracle Cloud Infrastructure Data Science 2023 Professional Questions and Answers
Which feature of the Oracle Cloud Infrastructure (OCI) Vision service helps you generate in-dexing tags for a collection of marketing photographs?
Which two statements are true about published conda environments?
You are preparing a configuration object necessary to create a Data Flow application. Which THREE parameter values should you provide?
While reviewing your data, you discover that your data set has a class imbalance. You are aware that the Accelerated Data Science (ADS) SDK provides multiple built-in automatic transformation tools for data set transformation. Which would be the right tool to correct any imbalance between the classes?
You have trained three different models on your data set using Oracle AutoML. You want to
visualize the behavior of each of the models, including the baseline model, on the test set. Which
class should be used from the Accelerated Data Science (ADS) SDK to visually compare the models?
Which of the following TWO non-open source JupyterLab extensions has Oracle Cloud In-frastructure (OCI) Data Science developed and added to the notebook session experience?
You are building a model and need input that represents data as morning, afternoon, or evening. However, the data contains a time stamp. What part of the Data Science life cycle would you be in when creating the new variable?
You are a data scientist trying to load data into your notebook session. You understand that Accelerated Data Science (ADS) SDK supports loading various data formats. Which of the following THREE are ADS supported data formats?
You have an embarrassingly parallel or distributed batch job on a large amount of data that you
consider running using Data Science Jobs. What would be the best approach to run the workload?
As a data scientist, you are working on a global health data set that has data from more than 50
countries. You want to encode three features such as 'countries', 'race' and 'body organ' as
categories.
Which option would you use to encode the categorical feature?
Which Oracle Accelerated Data Science (ADS) classes can be used for easy access to data sets from
reference libraries and index websites such as scikit-learn?
You want to write a Python script to create a collection of different projects for your data science
team. Which Oracle Cloud Infrastructure (OCI) Data Science interface would you use?
You are a data scientist leveraging the Oracle Cloud Infrastructure (OCI) Language AI service for
various types of text analyses. Which TWO capabilities can you utilize with this tool?
You have created a Data Science project in a compartment called Development and shared it
with a group of collaborators. You now need to move the project to a different compartment called
Production after completing the current development iteration.
Which statement is correct?
You want to make your model more parsimonious to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heat map that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method would be appropriate to display the correlation between Continuous and Categorical features?
You loaded data into Oracle Cloud Infrastructure (OCI) Data Science. To transform the data, you
want to use the Accelerated Data Science (ADS) SDK. When you applied the get_recommendations ()
tool to the ADSDataset object, it showed you user-detected issues with all the recommended
changes to apply to the dataset. Which option should you use to apply all the recommended
transformations at once?
You are asked to prepare data for a custom-built model that requires transcribing Spanish video
recordings into a readable text format with profane words identified.
Which Oracle Cloud service would you use?
You are working as a data scientist for a healthcare company. They decide to analyze the data to
find patterns in a large volume of electronic medical records. You are asked to build a PySpark
solution to analyze these records in a JupyterLab notebook. What is the order of recommended
steps to develop a PySpark application in Oracle Cloud Infrastructure (OCI) Data Science?
Which Oracle Cloud Infrastructure (OCI) service should you use to create and run Spark
applications using ADS?
Select two reasons why it is important to rotate encryption keys when using Oracle Cloud
Infrastructure (OCI) Vault to store credentials or other secrets.
You have just completed analyzing a set of images by using Oracle Cloud Infrastructure (OCI) Data
Labelling, and you want to export the annotated data. Which two formats are supported?
You want to ensure that all stdout and stderr from your code are automatically collected and
logged, without implementing additional logging in your code. How would you achieve this with Data
Science Jobs?
You want to evaluate the relationship between feature values and target variables. You have a
large number of observations having a near uniform distribution and the features are highly
correlated.
Which model explanation technique should you choose?
Youare a data scientist working for a manufacturing company. You have developed a forecasting
model to predict the sales demand in the upcoming months. You created a model artifact that
contained custom logic requiring third party libraries. When you deployed the model, it failed to run
because you did not include all the third party dependencies in the model artifact. What file should
be modified to include the missing libraries?