You are creating an Oracle Cloud Infrastructure (OCI) Data Science job that will run on a recurring
basis in a production environment. This job will pick up sensitive data from an Object Storage
bucket, train a model, and save it to the model catalog.
How would you design the authentication mechanism for the job?
The feature type TechJob has the following registered validators: Tech-Job.validator.register(name=’is_tech_job’, handler=is_tech_job_default_handler) Tech-Job.validator.register(name=’is_tech_job’, handler= is_tech_job_open_handler, condi-tion=(‘job_family’,)) TechJob.validator.register(name=’is_tech_job’, handler= is_tech_job_closed_handler, condition=(‘job_family’: ‘IT’)) When you run is_tech_job(job_family=’Engineering’), what does the feature type validator system do?
Where do calls to stdout and stderr from score.py go in a model deployment?
For your next data science project, you need access to public geospatial images.
Which Oracle Cloud service provides free access to those images?
You are a data scientist leveraging the Oracle Cloud Infrastructure (OCI) Language AI service for
various types of text analyses. Which TWO capabilities can you utilize with this tool?
You are a data scientist working for a utilities company. You have developed an algorithm that
detects anomalies from a utility reader in the grid. The size of the model artifact is about 2 GB, and
you are trying to store it in the model catalog. Which three interfaces could you use to save the
model artifact into the model catalog?
When preparing your model artifact to save it to the Oracle Cloud Infrastructure (OCI) Data Science model catalog, you create a score.py file. What is the purpose of the score.py fie?
As you are working in your notebook session, you find that your notebook session does not have
enough compute CPU and memory for your workload.
How would you scale up your notebook session without losing your work?
You are a data scientist designing an air traffic control model, and you choose to leverage Oracle
AutoML You understand that the Oracle AutoML pipeline consists of multiple stages and
automatically operates in a certain sequence. What is the correct sequence for the Oracle AutoML
pipeline?
You are attempting to save a model from a notebook session to the model catalog by using the
Accelerated Data Science (ADS) SDK, with resource principal as the authentication signer, and you
get a 404 authentication error. Which two should you look for to ensure permissions are set up
correctly?
What preparation steps are required to access an Oracle AI service SDK from a Data Science notebook session?
You are building a model and need input that represents data as morning, afternoon, or evening. However, the data contains a time stamp. What part of the Data Science life cycle would you be in when creating the new variable?
You have created a conda environment in your notebook session. This is the first time you are
working with published conda environments. You have also created an Object Storage bucket with
permission to manage the bucket.
Which two commands are required to publish the conda environment?
You train a model to predict housing prices for your city. Which two metrics from the
Accelerated Data Science (ADS) ADSEvaluator class can you use to evaluate the regression model?
You want to ensure that all stdout and stderr from your code are automatically collected and
logged, without implementing additional logging in your code. How would you achieve this with Data
Science Jobs?
As a data scientist, you create models for cancer prediction based on mammographic images.
The correct identification is very crucial in this case. After evaluating two models, you arrive at the
following confusion matrix.
Model 1 has Test accuracy is 80% and recall is 70%.
• Model 2 has Test accuracy is 75% and recall is 85%.
Which model would you prefer and why?
The Accelerated Data Science (ADS) model evaluation classes support different types of machine
learning modeling techniques. Which three types of modeling techniques are supported by ADS
Evaluators?
While reviewing your data, you discover that your data set has a class imbalance. You are aware
that the Accelerated Data Science (ADS) SDK provides multiple built-in automatic transformation
tools for data set transformation. Which would be the right tool to correct any imbalance between
the classes?