8 questions across 2 exams
You are designing an Azure Cosmos DB database for a multi-tenant IoT application. The application receives telemetry from millions of devices across 5,000 different corporate customers (tenants). You need to select a partition key for the telemetry container. Requirements: - Queries frequently filter by TenantId and DeviceId. - Data must be evenly distributed across logical partitions to avoid hot partitions. - Some large tenants generate more than 20 GB of telemetry data per month. Which partition key strategy should you recommend?
A logistics company wants to analyze GPS tracking data from its fleet of 100,000 trucks. The trucks send location and engine telemetry every 5 seconds. The data engineering team needs to perform complex, ad-hoc time-series analysis on this massive volume of append-only log data to identify route inefficiencies. The query language must support advanced analytics, windowing, and text search. Which Azure data service is best suited for this workload?
A legal firm has 5 million scanned PDF documents stored in Azure Blob Storage. They are building an internal search portal. The solution must meet the following requirements: - Extract printed text (OCR) from the scanned PDFs. - Identify key entities (people, organizations, locations) within the text. - Provide a searchable index that users can query via a web application. Which TWO Azure services should you combine to build this solution? (Select TWO)
You are designing a simple key-value store for a web application. The application requires storing user session state data. The data volume is small (under 50 GB), the query patterns are extremely simple (lookup by PartitionKey and RowKey), and the primary business requirement is to minimize costs. Global distribution is not required. Which storage solution should you recommend?
You are designing a globally distributed social media application using Azure Cosmos DB. Users frequently post updates and immediately refresh their feeds. A user must always see their own posts immediately after writing them. However, it is acceptable if other users across the globe see the post with a slight delay. You need to select a consistency level that meets these requirements while maximizing performance and availability. Which consistency level should you recommend?
A manufacturing company generates 10 TB of telemetry data daily from sensors on their assembly lines. The data is time-series in nature. Engineers need to run ad-hoc, complex analytical queries over billions of rows of this data to identify anomalies and optimize machine performance. The queries must return results in seconds. Which Azure data service is best optimized for this scenario?
A legal firm has millions of scanned PDF documents stored in Azure Blob Storage. Lawyers need to search through these documents using natural language queries. The solution must automatically extract text from the scanned images (OCR), identify key entities (people, organizations, locations), and make the data searchable via a web application. Which Azure service should you recommend as the core search engine?
You are migrating a legacy application that uses Azure Table Storage. The application has grown significantly. It now requires global distribution, single-digit millisecond latency for reads and writes, and automatic secondary indexes. You decide to migrate to Azure Cosmos DB for Table. Which TWO benefits does Cosmos DB for Table provide over standard Azure Table Storage? (Select TWO)
Full answers, grading, and explanations on why each answer is correct.