Learn how to build a fast and scalable data system on Azure Hyperscale (Citus) and Cosmos DB

While consumers demand organizations to be extremely responsive, agile and data-driven, enterprises need applications deployed in data centers close to their users. Therefore, it is critical that applications respond in real time, store exponentially growing pools of data, and present data to users in a fraction of a second.

As data volume and data consumption requirements increase, the database server must be scalable in terms of compute and storage. Fully managed data services can power these workloads within milliseconds of response time and ensure scalability and speed.

To understand the potential and high scalability of the most advanced open source databases, Microsoft and YourStory jointly hosted a webinar titled “Building High Performance Applications with Unlimited Scale on a Hyperscale Database” with panelists Brian McKerrTechnical specialist for open source database, Microsoft Azure and Srikant SridharSenior Specialist – Azure Cosmos DB.

Improve application performance with Postgres on Azure DB

“We have three options for deploying Postgres on Azure: a single-node database with advanced security and availability, a flexible server (a single-node Postgres database) taking into account the features and performance of the enterprise, and Hyperscale Citus (a scalable distributed Postgres database) to scale thousands of nodes and petabytes of disk storage,” Brian said.

Discussing the key features of Citus, Brian indexed managing huge infrastructure with security backups, high availability, disaster recovery, integration, scaling, and performance.

It virtually demonstrated Citus’ work on how high-performance, large-scale applications can deploy capacity and resources with connection pooling to store terabytes to petabytes. He talked about Azure Data Ecosystem including Azure Data Factory, Azure Functions and Azure App Service for easy deployment and automated scripting options using SQL queries.

“The solutions are offers where the entire database is distributed. And, depending on your application, we give you the opportunity to choose an appropriate model. We can rebalance live data non-disruptively to handle shard movements from nodes to new nodes, allowing us to distribute data evenly. Additionally, deploying a tenant isolation feature allows you to dedicate isolated resource capacity to specific tenants,” added Brian.

Hyperscale (Citus) on Azure Database

According to Brian, Hyperscale (Citus) is a built-in option in Azure Database for PostgreSQL managed services that focuses on application performance rather than spending time on database management.

“One of the features of Citus 10 for analytics workloads is columnar compression to store data in column-based tables. Another feature is Read Replicas, asynchronous replication of your database to offload analytical workloads by copying data,” he added.

Among many features, Brian highlighted a management pgBouncer to scale massive connections, an ability to configure custom maintenance windows to provide full control when performing platform maintenance, easier fragmented key run command, and MX (distributed metadata with linear performance improvements) allowing applications and users to connect to any node in a cluster than approach through the coordinator.

He showcased some scalability demanding use cases and projects that have proven to simplify system architecture, break down skill barriers, and meet business expectations.

How to select the right API in Azure Cosmos DB

“Cosmos DB is the fastest NoSQL database available on the market. It is a multimodal NoSQL database and can support multiple open APIs for your workload needs. Cosmos DB can be used as a document-based database, column-based database, graph database, or key-value pair database as needed,” Srikant noted.

Cosmos DB offers sub-10 millisecond latency, unlimited scale as storage to build scalable, high-performance applications with availability in many regions. For application development and the technical community, Cosmos SDKs support major programming languages ​​such as .NET, Java, Python, PHP, and Nodejs.

“It’s highly scalable to handle big data workloads. With high availability and geo-replication functionality, you can replicate data to any part of the world seamlessly, attract users from different geographies, and have turnkey global replication capability,” he added.

According to him, Cosmos DB has many applications in several sectors; for example, e-commerce, supply chain, financial sector, retail, telecommunications, gaming and many more.

Key differentiators such as flexibility, scalability, real-time analysis of data stored on Cosmos DB at minimal cost, predictive maintenance, consistency, availability, etc. make Cosmos DB an efficient product to meet the requirements of creating any scalable application.


Comments are closed.