Data professionals with varying titles use the Databricks SQL service as the primary touchpoint with the Databricks Lakehouse Platform. However, some users will use other services like Databricks Machine Learning or Databricks Data Science and Engineering.
Which of the following roles uses Databricks SQL as a secondary service while primarily using one of the other services?
A. Business analyst
B. SQL analyst
C. Data engineer
D. Business intelligence analyst
E. Data analyst
A data analyst has set up a SQL query to run every four hours on a SQL endpoint, but the SQL endpoint is taking too long to start up with each run.
Which of the following changes can the data analyst make to reduce the start-up time for the endpoint while managing costs?
A. Reduce the SQL endpoint cluster size
B. Increase the SQL endpoint cluster size
C. Turn off the Auto stop feature
D. Increase the minimum scaling value
E. Use a Serverless SQL endpoint
A data analyst needs to use the Databricks Lakehouse Platform to quickly create SQL queries and data visualizations. It is a requirement that the compute resources in the platform can be made serverless, and it is expected that data visualizations can be placed within a dashboard.
Which of the following Databricks Lakehouse Platform services/capabilities meets all of these requirements?
A. Delta Lake
B. Databricks Notebooks
C. Tableau
D. Databricks Machine Learning
E. Databricks SQL
Which of the following should data analysts consider when working with personally identifiable information (PII) data?
A. Organization-specific best practices for PII data
B. Legal requirements for the area in which the data was collected
C. None of these considerations
D. Legal requirements for the area in which the analysis is being performed
E. All of these considerations
A data analyst runs the following command:
SELECT age, country FROM my_table WHERE age >= 75 AND country = 'canada';
Which of the following tables represents the output of the above command?
A. Option A
B. Option B
C. Option C
D. Option D
E. Option E
They are using the following incomplete command:
Which of the following lines of code can they use to fill in the blank in the above code block so that it successfully completes the task?
A. array distinct(products)
B. explode(products)
C. reduce(products)
D. array(products)
E. flatten(products)
A data analyst has created a user-defined function using the following line of code:
CREATE FUNCTION price(spend DOUBLE, units DOUBLE)
RETURNS DOUBLE
RETURN spend / units;
Which of the following code blocks can be used to apply this function to the customer_spend and customer_units columns of the table customer_summary to create column customer_price?
A. SELECT PRICE customer_spend, customer_units AS customer_priceFROM customer_summary
B. SELECT priceFROM customer_summary
C. SELECT function(price(customer_spend, customer_units)) AS customer_priceFROM customer_summary
D. SELECT double(price(customer_spend, customer_units)) AS customer_priceFROM customer_summary
E. SELECT price(customer_spend, customer_units) AS customer_priceFROM customer_summary
A data analyst has been asked to count the number of customers in each region and has written the following query:
If there is a mistake in the query, which of the following describes the mistake?
A. The query is using count(*), which will count all the customers in the customers table, no matter the region.
B. The query is missing a GROUP BY region clause.
C. The query is using ORDER BY, which is not allowed in an aggregation.
D. There are no mistakes in the query.
E. The query is selecting region, but region should only occur in the ORDER BY clause.
A data analyst has created a Query in Databricks SQL, and now they want to create two data visualizations from that Query and add both of those data visualizations to the same Databricks SQL Dashboard. Which of the following steps will they need to take when creating and adding both data visualizations to the Databricks SQL Dashboard?
A. They will need to alter the Query to return two separate sets of results.
B. They will need to add two separate visualizations to the dashboard based on the same Query.
C. They will need to create two separate dashboards.
D. They will need to decide on a single data visualization to add to the dashboard.
E. They will need to copy the Query and create one data visualization per query.
A data analyst has been asked to provide a list of options on how to share a dashboard with a client. It is a security requirement that the client does not gain access to any other information, resources, or artifacts in the database. Which of the following approaches cannot be used to share the dashboard and meet the security requirement?
A. Download the Dashboard as a PDF and share it with the client.
B. Set a refresh schedule for the dashboard and enter the client's email address in the "Subscribers" box.
C. Take a screenshot of the dashboard and share it with the client.
D. Generate a Personal Access Token that is good for 1 day and share it with the client.
E. Download a PNG file of the visualizations in the dashboard and share them with the client.