Author: Siddon Tang (Chief Architect at PingCAP)
Transcreator: Caitin Chen; Editor: Tom Dewan
Today, I'm proud to announce that TiDB 4.0 has reached general availability (GA). TiDB is an elastic, real-time Hybrid Transactional/Analytical Processing (HTAP) database, and, best of all, it's now ready for the cloud.
The first half of 2020 has seen a world of uncertainty and unpredictability, and it's high time for IT technology to keep up, especially for databases because it's the foundation of every business in the world. In a previous post, our CTO Ed Huang shared his thoughts about the future of databases. We believe that the future of the database is about unification, adaptiveness, and intelligence. As you may have seen in our TiDB 4.0 preview, our new release is a big step closer to achieving that vision.
TiDB 4.0 has made great progress in its stability, ease of use, performance, and cloud-native services. New features in version 4.0 also let TiDB support more application scenarios and many of our users have been testing and adopting the 4.0 version in their production environments. Now we believe the TiDB 4.0 GA version has become the right database solution to prepare you for the unpredictable world.
TiDB 4.0 is a real-time HTAP, truly elastic, cloud-native database, which meets your application requirements in various scenarios. In fact, TiDB 4.0 can do the work of three different databases:
TiDB 4.0 is a one-stop solution for both Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP) applications to process HTAP workloads. Even if you are not sure whether your application is OLTP or OLAP, you can efficiently run your application on TiDB.
TiFlash is an extended analytical engine and columnar store for TiDB. A TiDB database that incorporates TiFlash lets you perform real-time HTAP analytics.
TiFlash offers you:
Serverless computing is a very important concept in the field of cloud services. In TiDB 4.0, we not only support real-time HTAP analytics in the cloud, but also implement the first version of an elastic scheduling mechanism. This turns TiDB into a truly serverless database in the cloud.
Now, you only need to deploy your TiDB cluster in the cloud (or your own Kubernetes cluster) with the minimum cluster topology and to configure rules. For example, one rule could be that when TiDB CPU usage exceeds 50%, a TiDB node is automatically added. Then, based on your own application load, TiDB will automatically:
This feature is new in TiDB 4.0. We believe it will be the cornerstone of many product possibilities in the future.
Compared with version 3.0, TiDB 4.0 has achieved a significant performance boost. For OLTP scenarios, the Sysbench and TPC-C benchmarks have improved by 30% to 50%. For OLAP scenarios, the TPC-H performance has increased by about 100% on average over a set of 22 queries. In addition, TiFlash greatly enhances TiDB's real-time analytics capabilities without affecting OLTP tasks.
TiDB versions to compare:
TiDB 3.0.13 vs. TiDB 4.0.0
Hardware configuration:
Component | EC2 type | Instance count |
Placement Driver (PD) | AWS m5.xlarge | 3 |
TiKV | AWS i3.4xlarge | 3 |
TiDB | AWS c5.4xlarge | 3 |
We ran Sysbench tests on 16 tables, each with 10 million rows of data. For more detailed information on the system configurations and the tests we ran, see the Sysbench Performance Test Report.
The tests showed that compared with version 3.0, TiDB 4.0's Point Select performance increased by about 14%, and read-write performance increased by 31%.
In the TPC-C test, we found that TiDB 4.0 performed about 50% better. For test details, see the TiDB TPC-C Performance Test Report.
We ran TPC-H queries on TiDB 3.0.13 and TiDB 4.0 to compare their OLAP capabilities. For test details, see the TiDB TPC-H Performance Test Report – v4.0 vs. v3.0.
Because TiDB 4.0 introduces TiFlash to strengthen TiDB's HTAP capabilities, our test objects were:
The test results showed that TiDB's query performance notably improved—about 100% on average.
TiDB 4.0 also has many new features and improvements, in terms of security, ecosystem, and features.
TiDB 4.0 introduces TiUP, a component manager that streamlines installing and configuring a TiDB cluster into a few easy commands. With TiUP, you can get your cluster up in just one minute! For details, see Deploy a TiDB Cluster Using TiUP.
It's difficult to troubleshoot issues in a distributed database because system information is scattered among different machines. TiDB 4.0 offers TiDB Dashboard, a graphical interface with various built-in widgets that lets you easily diagnose, monitor, and manage your clusters. You can read these blog posts to learn more:
As users store more and more data in TiDB, quickly backing up and restoring data becomes a big challenge. In TiDB 4.0, we provide Backup & Restore (BR), a distributed backup and restore tool that offers high backup and restore speeds—1 GB/s or more for 10 TB of data. BR already supports backup to AWS S3 and will soon support backup to Google Cloud Storage. To get more details, see our BR documentation and blog post.
In TiDB 4.0, we introduce a new TiDB change data capture framework, TiCDC. This open-source feature replicates TiDB's incremental changes to downstream platforms. For more details, see TiCDC: Replication Latency in Milliseconds for 100+ TB Clusters.
This post shows only a few of the highlights in TiDB 4.0. For a full list of features and improvements, check out the TiDB 4.0 GA Release Notes. If you're running an earlier version of TiDB and want to try 4.0, read Upgrade TiDB Using TiUP. You're also welcome to join our community on Slack and send us your feedback.
This release is a big step forward for TiDB on its way to becoming “the database of the future.” We'd like to thank all our contributors and TiDB users who have helped us shape TiDB into what it is today.