Fundamentals of the Databricks Lakehouse Platform Accreditation

Multiple Choice 1) Which of the following is a benefit of the Databricks Lakehouse Platform being designed to support all data and artificial intelligence (AI) workloads? Select four responses.

Data workloads can be automatically scaled when needed.
Data teams can all utilize secure data from a single source to deliver reliable, consistent results across workloads at scale.
There is increased need for multiple, specialist platform administrators to maintain each component of the unified platform.Data analysts, data engineers, and data scientists can easily collaborate within a single platform.
Analysts can easily integrate their favorite business intelligence (BI) tools for further analysis.

Multiple Choice 2) Which of the following describes what challenges a data organization would likely face when migrating from a data warehouse to a data lake? Select two responses.

There are increased performance speeds in a data lake.
There are increased security and privacy concerns in a data lake.
There are increased data quality guarantees in a data lake
There are increased data reliability issues in a data lake.There are increased cloud storage costs in a data lake.

Multiple Choice 3) Data organizations need specialized environments designed specifically for machine learning workloads. Which of the following is made available by Databricks as part of Databricks Machine Learning to support machine learning workloads? Select four responses.

Built-in automated machine learning development
Built-in real-time model serving
Support for distributed model training on big data
Lakehouse-specific deep learning frameworks
Optimized and preconfigured machine learning frameworks

Single choice 4) One of the foundational technologies provided by the Databricks Lakehouse Platform is an open-source, file-based storage format that provides a number of benefits. These benefits include ACID transaction guarantees, scalable data and metadata handling, audit history and time travel, table schema enforcement and schema evolution, support for deletes/updates/merges, and unified streaming and batch data processing. Which of the following technologies is being described in the above statement? Select one response.

Single choice 5) Which of the following lists the relational entities in order from largest (most coarse) to smallest (most granular) within their hierarchy? Select one response.

Schema (Database) → Metastore → Catalog → Table
Catalog → Metastore → Schema (Database) → Table
Metastore → Catalog → Table → Schema (Database)
Schema (Database) → Catalog → Table → Metastore
Metastore → Catalog → Schema (Database) → Table

Single choice 6) In the past, a lot of data engineering resources needed to be contributed to the development of tooling and other mechanisms for creating and managing data workloads. In response, Databricks developed and released a declarative ETL framework so data engineers can focus on helping their organizations get value from their data. Which of the following technologies is being described above? Select one response.

Multiple Choice 7) Which of the following architecture benefits is provided directly by the Databricks Lakehouse Platform? Select three responses.

Available on and across multiple cloudsBuilt on open source and open standards
Scalable, redundant cloud-based data storage
Efficient on-premises optimized hardwareUnified security and governance approach for all data assets

Multiple Choice 8) Many organizations use a variety of open-source and proprietary tools for data orchestration, but these tools often have their own limitations. To address the orchestration needs of these organizations, Databricks developed Databricks Workflows. Which of the following is a benefit of using Databricks Workflows for orchestration purposes? Select two responses.

Databricks Workflows supports tasks for data ingestion, data engineering, machine learning, and business intelligence (BI)
Databricks Workflows provides Git-backed version control capabilities to notebooks
Databricks Workflows supports workloads across multiple cloud service providers and tools
Databricks Workflows supports automating workloads as long as they are not in notebooksDatabricks Workflows provides multiple-task workflow functionality only for Delta Live Tables workloads

Single choice 9) While the Databricks Lakehouse Platform provides support for many types of data, analytics, and machine learning workloads, some organizations prefer to continue using other preferred vendors for use cases like data ingestion, data transformation, business intelligence, and machine learning.

Databricks can be used on-premises to allow for secure, in-house integrations.
Databricks can be used locally to allow developers to manually integrate with other systems.
Databricks cannot be used alongside other big data tools and platforms.
Databricks can use cloud service provider capabilities to efficiently share data with other data tools and platforms.
Databricks can be integrated directly with a large number of Databricks partners.

Multiple Choice 10) Which of the following correctly describes how a specific capability of the Databricks Lakehouse Platform supports a data streaming pattern? Select three responses.

Databricks Workflows automatically passes data from task to task in regular microbatches.
Auto Loader continuously and incrementally ingests streaming data.
Structured Streaming enables stream-based machine learning inference.
Delta Live Tables processes ETL pipelines on streaming data with advanced monitoring mechanisms.
MLflow ingests its automatic experiment tracking data into a stream for continuous monitoring.

Multiple Choice 11) Which of the following is a common problem within a data lake architecture that can be easily solved by using the Databricks Lakehouse Platform? Select three responses.

Inability to use open-source data formats
Too many small filesLack of cloud service integrationsLack of ACID transaction support
Ineffective partitioning

Single choice 12) Unity Catalog offers improved Lakehouse data object governance and organization capabilities for data segregation. Which of the following is a consequence of using Unity Catalog to manage, organize and segregate data objects? Select one response.

Single choice 13) It can be challenging for a data lakehouse to provide both performance and scalability for all of its query-based workloads to the standards of a data warehouse and a data lake. As a result, Databricks has introduced a technology built atop Apache Spark to further speed up and scale these varied workloads. Which of the following technologies is being described in the above statement? Select one response.

Multiple Choice 14) In which of the following ways do serverless compute resources differ from classic compute resources within the Databricks Lakehouse Platform? Select two responses.

They are always running and reserved for a single, specific customer when needed
They result in lower costs by not overprovisioningThey are located within the cloud
They exist within the Databricks cloud accountThey exist within the customer cloud account

Multiple Choice 15)The Databricks Lakehouse Platform architecture consists of a control plane and a data plane. Which of the following resources exists within the Databricks control plane? Select two responses.

Multiple Choice 16) Maintaining and improving data quality is a major goal of modern data engineering. Which of the following contributes directly to high levels of data quality within the Databricks Lakehouse Platform? Select two responses.

Data expectations enforcement
Apache Spark’s data format flexibility
Table schema evolution
Simplified machine learning model serving
Business intelligence (BI) tool integrations

Single choice 17) Data sharing has traditionally been performed by proprietary vendor solutions, SSH File Transfer Protocol (SFTP), or cloud-specific solutions. However, each of these sharing tools and solutions comes with its own set of limitations. As a result, Databricks helped to develop the solution, Delta Sharing. Which of the following describes Delta Sharing as a solution for data sharing? Select one response.

Delta Sharing is a multicloud, proprietary solution for efficiently copying and transferring data from the lakehouse to any external system.
Delta Sharing is a multicloud, proprietary solution to securely and efficiently share data while maintaining control of the source data.
Delta Sharing is a multicloud, open-source solution for distributing data across a number of compute resources for efficient data shuffling.
Delta Sharing is a multicloud, open-source solution to securely and efficiently share live data from the lakehouse to any external system.
Delta Sharing is a multicloud, open-source solution to share data between Databricks workspaces within a single Databricks account.

Single choice 18) Which of the following Databricks Lakehouse Platform services or capabilities provides a data warehousing experience to its users? Select one response.

Multiple Choice 19) Which of the following data engineering capabilities simplifies the work of data engineers on the Databricks Lakehouse Platform? Select three responses.

SQL and Python development compatibility
End-to-end data pipeline visibility
Automatic deployment and data operations
Serverless cluster startup timesFlexible machine learning development solutions

Multiple Choice 20) Which of the following is a security feature made available in the Databricks Lakehouse Platform by Unity Catalog? Select two responses.

Single-source-of-truth identity management
Databricks SQL warehouse access control
Fine-grained access control on data objects
Workspace-specific identity management
Workspace-specific data metastores

Single choice 21) Which of the following do Databricks SQL users experience when using serverless Databricks SQL warehouses rather than classic Databricks SQL warehouses? Select one response.

Expedited environment startup
Availability of automatic scaling
Performance degradation on long-running queries
Availability of Photon
Increased total cost of use

Multiple Choice 22) Which of the following compute resources is available in the Databricks Lakehouse Platform? Select two responses.

Multiple Choice 23) A data architect is evaluating data warehousing solutions for their organization to use. As a part of this, the architect is considering the Databricks Lakehouse Platform. Which of the following is a benefit of using the Databricks Lakehouse Platform for warehousing? Select four responses.

Built-in governance for single-source-of-truth data
A rich ecosystem of business intelligence (BI) integrations
Local development software to integrate with other capabilitiesEngineering capabilities supporting warehouse source dataBest available price/performance

Single choice 24) Which of the following describes the motivation for the creation of the data lakehouse? Select one response.

Organizations needed a single, flexible, high-performance system to support data, analytics, and machine learning workloads.
Organizations needed to reduce the costs of storing their open-format data files in the cloud.
Organizations needed to be able to develop increasingly complex machine learning workloads using a simple, SQL-based solution.
Organizations needed a way to scale their data lake workloads without investing in additional on-premises hardware.
Organizations needed a reliable data management system with transactional guarantees for their structured data.

Single choice 25) Which of the following describes how the Databricks Lakehouse Platform makes data governance simpler? Select one response.

Unity Catalog provides a different governance solution for each major Databricks Lakehouse Platform Service.
Unity Catalog provides a different governance solution for each cloud.
Unity Catalog provides a single governance solution across workload types and clouds.
Unity Catalog provides a different governance solution for each workload.
Unity Catalog provides a single governance solution fully managed by the Databricks team.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md
answers.md		answers.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fundamentals of the Databricks Lakehouse Platform Accreditation

About

Uh oh!

Releases

Packages

harrydevforlife/DLP-accreditation

Folders and files

Latest commit

History

Repository files navigation

Fundamentals of the Databricks Lakehouse Platform Accreditation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages