16 questions across 4 exams
You are designing a globally distributed application using Azure Cosmos DB. The database is replicated across 4 regions. The application is a social media feed. When a user posts an update, it is critical that the user immediately sees their own post. However, it is acceptable if users in other regions see the post with a slight delay. You want to maximize read throughput and minimize latency. Which Cosmos DB consistency level should you recommend?
You are designing an Azure Cosmos DB database for an IoT application that tracks vehicle telemetry. The database will store millions of documents per day. You need to select a partition key. The most common query will retrieve all telemetry for a specific vehicle on a specific date. Which TWO characteristics describe a good partition key for this workload? (Select TWO)
Your company generates 10 TB of application log data daily. You need to design a solution to ingest, store, and analyze this data. The analytics team requires the ability to run highly complex, ad-hoc queries over billions of rows of time-series data with sub-second response times. They are proficient in Kusto Query Language (KQL). Which TWO Azure services are best suited for this architecture? (Select TWO)
You are architecting an e-commerce web application hosted on Azure App Service. During flash sales, the application experiences massive spikes in traffic, causing the backend Azure SQL Database to become overwhelmed with read requests for the product catalog. The product catalog data changes infrequently. Which service should you introduce to the architecture to improve performance and reduce the load on the database?
A social media startup is designing a feature to recommend friends to users based on mutual connections, shared interests, and geographic proximity. The data model requires traversing complex relationships with multiple degrees of separation (e.g., 'Find friends of friends who like hiking'). The queries must return results in milliseconds. Which Azure Cosmos DB API should you recommend?
A manufacturing company has 5,000 factory machines equipped with sensors. The sensors generate 10 TB of time-series telemetry data per day. Data analysts need to perform interactive, ad-hoc queries over the last 6 months of data to identify anomaly patterns. The queries often involve complex aggregations, time-windowing, and joining massive datasets. The solution must provide sub-second query performance. Which TWO Azure services should you combine to ingest and analyze this data? (Select TWO)
A legal firm has 2 million scanned PDF documents stored in Azure Blob Storage. These documents contain contracts, court filings, and evidence. Lawyers need a web portal to search for specific clauses, names, and organizations across all documents. Because the documents are scanned images, the text must be extracted automatically during the indexing process. Additionally, the solution must automatically identify and tag the names of people and organizations found in the text. Which Azure service should you recommend?
A small application needs to store 50 GB of simple user profile data (key-value pairs). The application is hosted in a single Azure region. The data structure is flat, and queries will only ever look up a user by their unique UserID. The primary architectural driver is to minimize monthly costs. Which data storage solution should you recommend?
You are designing an Azure Cosmos DB (NoSQL API) database for an IoT fleet management system. The system receives telemetry from 100,000 vehicles. Each vehicle sends a JSON document every minute. The most common query run by the application is: `SELECT * FROM c WHERE c.VehicleId = 'V-12345' AND c.Timestamp > '2023-01-01'`. You need to select a partition key that ensures even distribution of storage and throughput, and minimizes cross-partition queries. Which property should you choose as the partition key?
Your company is building a centralized logging and analytics platform for a fleet of autonomous drones. The drones generate 5 TB of time-series log data daily. Data scientists need to run complex, ad-hoc analytical queries over billions of rows of this telemetry data to identify flight anomalies. The queries must return results in seconds. Which TWO Azure services are best suited for storing and querying this massive volume of time-series log data? (Select TWO)
A legal firm has 10 million scanned PDF documents stored in Azure Blob Storage. You are designing a search solution using Azure Cognitive Search. Users must be able to search for specific keywords contained within the scanned images of the PDFs. Additionally, the solution must automatically identify and extract the names of people and organizations mentioned in the text. Which feature of Azure Cognitive Search must you configure?
You are designing a cloud-native application that requires a NoSQL key-value store. The application will store user session state data. The data volume is expected to be small (under 50 GB), and the application will only be deployed in a single Azure region. Cost optimization is the primary architectural driver, as the startup has a very limited budget. The application code uses the standard Azure Table Storage SDK. Which data storage solution should you recommend?
You are designing a fraud detection system that analyzes relationships between users, devices, IP addresses, and transactions. The system needs to traverse these relationships rapidly to identify complex fraud rings. Which Azure Cosmos DB API should you recommend?
A manufacturing company collects 50 TB of telemetry data daily from factory sensors. They need a data exploration service that can ingest this massive volume of time-series data with low latency and allow engineers to run ad-hoc analytical queries using a SQL-like language. Which service should you recommend?
You are designing a document search solution for a legal firm. They have millions of scanned PDF contracts stored in Azure Blob Storage. You need to extract text from the images within the PDFs, identify key entities (like names and organizations), and make the documents searchable via a web portal. Which TWO Azure services are required for this solution? (Select TWO)
A developer needs to store simple key-value data. The data volume is small, and the application requires a highly cost-effective solution. There is no requirement for global distribution, secondary indexes, or guaranteed sub-millisecond latency. Which storage solution should you recommend?
Full answers, grading, and explanations on why each answer is correct.