Compare the Top Data Deduplication Software in China as of July 2025

What is Data Deduplication Software in China?

Data deduplication software enables organizations to eliminate duplicate data from a data set in order to reduce the amount of redundant data in a dataset and reduce storage costs and utilization, as well as improve data quality. Compare and read user reviews of the best Data Deduplication software in China currently available using the table below. This list is updated regularly.

  • 1
    D&B Connect

    D&B Connect

    Dun & Bradstreet

    Realize the true potential of your first-party data. D&B Connect is a customizable, self-service master data management solution built to scale. Eliminate data silos across the organization and bring all your data together using the D&B Connect family of products. Benchmark, cleanse, and enrich your data using our database of hundreds of millions of records. The result is an interconnected, single source of truth that empowers your teams to make more confident business decisions. Drive growth and reduce risk with data you can trust. With a clean, complete data foundation, your sales and marketing teams can align territories with a full view of account relationships. Reduce internal conflict and confusion over incomplete or bad data. Strengthen segmentation and targeting. Increase personalization and the quality/quantity of marketing-sourced leads. Improve accuracy of reporting and ROI analysis.
    View Software
    Visit Website
  • 2
    ArchiverFS

    ArchiverFS

    MLtek Limited

    The file archiving solution for servers and network storage systems that lets you use any device as second tier storage. Featuring a tiny footprint on the host system along with full support for cloud, DFS, replication, de-duplication, and compression ArchiverFS lets you use any NAS, SAN or cloud platform as storage for your old unstructured files. If you can share it to the network with a UNC path and format it with NTFS then you can use it as second line storage. At no point do we use a database to store files, pointers to files or file meta data. ArchiverFS uses pure NTFS from start to finish. ArchiverFS lets you move your old unused files on-mass from you primary first tier storage to secondary storage whilst persisting all file attributes, permissions and directory structures. A selection of links can be left behind in place of old files that have been moved including completely seamless symbolic links that look and behave just like the original file.
    Starting Price: $1590.00/year
  • 3
    WinPure Clean & Match
    WinPure Clean & Match is WinPure’s award-winning data cleansing and data matching software suite, specially designed to increase the accuracy of business or consumer data. This software suite is ideal for cleaning, correcting and deduplicating mailing lists, databases, spreadsheets and CRMs. WinPure™ Clean & Match will help save your business time and money. * Increase the accuracy of virtually ANY list, spreadsheet, database, CRM, etc. * Locally installed Windows software so no need to worry about security as all processing is done on your own systems * Save hours of valuable time cleaning and removing duplicated records from your lists or databases using built-in sophisticated fuzzy and phonetic match algorithms. * Affordable licences available with World Class Support & Training. * Free Demo with Live Online Training available.
    Starting Price: $999
  • 4
    Druva

    Druva

    Druva

    Druva Data Security Cloud is a leading SaaS-first platform that delivers comprehensive data protection and rapid recovery across cloud, hybrid, and endpoint environments. It offers zero-trust security, AI-powered threat detection, and automated ransomware recovery to safeguard critical business data. Designed for modern enterprises, Druva enables scalable, cost-efficient backup and compliance with industry standards such as SOC2, HIPAA, and FedRAMP—all managed from a single secure cloud platform.
    Starting Price: $4 per user per month
  • 5
    Narrative

    Narrative

    Narrative

    Create new streams of revenue using the data you already collect with your own branded data shop. Narrative is focused on the fundamental principles that make buying and selling data easier, safer, and more strategic. Ensure that the data you access meets your standards, whatever they may be. Know exactly who you’re working with and how the data was collected. Easily access new supply and demand for a more agile and accessible data strategy. Own your data strategy entirely with end-to-end control of inputs and outputs. Our platform simplifies and automates the most time- and labor-intensive aspects of data acquisition, so you can access new data sources in days, not months. With filters, budget controls, and automatic deduplication, you’ll only ever pay for the data you need, and nothing that you don’t.
    Starting Price: $0
  • 6
    Match2Lists

    Match2Lists

    Match2Lists

    Match2Lists is the fastest, easiest and most accurate way to Match, Merge and De-duplicate your data. With Our Match2D&B option, you can enrich your data with Dun & Bradstreet information on-demand. In just minutes, you can cleanse your data of duplicates and blend raw data from different sources into powerful information. Our first objective is maximum match results for our customers. Prior to creating Match2Lists, we ran analytics and data visualisation companies and used most "fuzzy" matching software on the market. Unsatisfied by their low match results, we spent 10 years developing the most advanced data matching logic. Our second objective is time: enable our customers to spend less time matching and cleansing data and more time analysing and executing. So we implemented our advanced matching logic on the fast in-memory cloud computing architecture we could find, capable of matching 200 million records in 30 seconds.
    Starting Price: $95 per month
  • 7
    Duplicate Search and Merge
    Duplicate Search and Merge is a native deduplication application built for Salesforce. It is an easy to use deduplication tool which cleanses the duplicate records using a simple yet powerful 5 step wizard-based approach to search duplicates on standard and custom objects.
    Starting Price: $99
  • 8
    Flowcore

    Flowcore

    Flowcore

    The Flowcore platform provides you with event streaming and event sourcing in a single, easy-to-use service. Data flow and replayable storage, designed for developers at data-driven startups and enterprises that aim to stay at the forefront of innovation and growth. All your data operations are efficiently persisted, ensuring no valuable data is ever lost. Immediate transformations and reclassifications of your data, loading it seamlessly to any required destination. Break free from rigid data structures. Flowcore's scalable architecture adapts to your growth, handling increasing volumes of data with ease. By simplifying and streamlining backend data processes, your engineering teams can focus on what they do best, creating innovative products. Integrate AI technologies more effectively, enriching your products with smart, data-driven solutions. Flowcore is built with developers in mind, but its benefits extend beyond the dev team.
    Starting Price: $10/month
  • 9
    Nucleus

    Nucleus

    Nucleus

    Nucleus is a data management platform designed to streamline and automate the handling of customer and operational data across various systems. It enables users to connect and link similar records through smart matching, utilizing exact and fuzzy matching techniques with customizable auto-match thresholds. It allows for the definition of trigger-based rules to automatically address data conflicts, duplications, and the emergence of new or missing records, ensuring consistent and reliable data across integrations. Nucleus supports the development of automations that update or send notifications based on detailed contact and revenue criteria, aiding in the maintenance of a comprehensive data strategy. It also facilitates the management of data loading and large-scale updates, aligning with multiple integration sources.
    Starting Price: $160 per month
  • 10
    Barracuda Backup

    Barracuda Backup

    Barracuda Networks

    Don't let criminals hold your data hostage. With Barracuda, recovering your data is as simple as eliminating the malware, deleting the criminally encrypted files, and restoring a good copy of your valuable data. Get your systems restored and running quickly from physical appliances, virtual servers, offsite locations, or the cloud. Today's IT environments combine physical servers, virtual servers and public cloud data which all need full protection. Important data also resides in mail servers which may have limited retention policies. Barracuda protects your data no matter where it is located. Today's complex infrastructures and targeted cyber-attacks require a complete backup strategy that protects data wherever it resides— on‑premises or in the cloud. Simple to configure and manage, Barracuda Backup is truly a "set it and forget it" solution for total peace of mind.
    Starting Price: $999 one-time payment
  • 11
    Dedup-Manager
    Clean your data en masse and automatically, avoid duplicate records and duplicate work. ZaapIT enables CRM admins and power-users to clean any kind of duplicated data (same-object and cross-objects) en masse and automatically. All you need to do is to setup a set of rules and let the app process the data for you.
    Starting Price: $328/user/year
  • 12
    HybriStor

    HybriStor

    Neverfail

    HybriStor delivers deduplication across sites, replication to multiple sites and WAN optimization between sites. This groundbreaking secondary storage globally dedupes data by rates up to 30:1 - moving backup, archive and recovery data off expensive primary storage and onto high-performance, low-cost secondary storage. Solving your data storage growth problems just got easier, enabling you to meet blazing fast recovery requirements on-premise, across sites, and even into the cloud while reducing storage costs.
  • 13
    Unitrends MSP
    Attack the downtime problem without the hassle and anxiety of legacy backup. Switch to a solution built on 30 years of innovation with no upfront cost – making the promise of cloud economics achievable for every MSP. The Unitrends MSP Portal is built to give you complete visibility into your entire backup universe so you can monitor and manage everything from one place. Who has time to manage backups all day? The Unitrends MSP Portal is tightly focused on helping you address problems so you can get in, get out, and get on with your day. BackupIQTM uses artificial intelligence to surface the most important issues so you can feel confident that your technicians are working on the right things all the time. Automatically send beautiful reports every week, month, or quarter so your customers rest easy knowing they’ve got a stellar team and world class technology keeping their business up and running.
  • 14
    DataGroomr

    DataGroomr

    DataGroomr

    Deduplicate Salesforce the Easy Way. DataGroomr leverages Machine Learning to detect duplicate Salesforce records automatically. Duplicate records are loaded into a queue for users to compare records side-by-side, select which values to retain, append new values and merge. DataGroomr has everything you need to find, merge and get rid of dupes for good. No need to set up complex rules, DataGroomr's Machine Learning algorithms do the work for you. Conveniently merge duplicate records as-you-go or merge en masse, all directly from within the app. Select field values for master record or use inline editing to define new values as you deduplicate. Don't want to review org-wide duplicates? Define your own dataset by region, industry or any Salesforce field. Leverage the import wizard to deduplicate, merge and append records while importing to Salesforce. Set up automated duplication reports and mass merge tasks at a frequency that fits your schedule.
    Starting Price: $99 per user per year
  • 15
    LeadAngel

    LeadAngel

    LeadAngel

    LeadAngel smart-matches incoming leads with existing accounts and distributes leads among your sales team using the most powerful & flexible lead routing and lead matching algorithm available. We as a team helps your business to drive sales with the automated lead management. The application offers data standardization, fuzzy matching, lead segmentation, Contact Routing and Account Routing and lead to account matching in a user-friendly interface with smart drag or drop options. The solutions are built with API's to help you leverage everything our platform has to offer. Eliminating duplicate leads, merge with existing contacts, and removing redundant accounts with LeadAngel’s powerful data cleanup engine and track the entire procedure with LeadAngel's reporting where each and every step is visible. Further optimize your sales funnel with tools such as auto conversion of leads into contacts if a matching account is found.
  • 16
    LinkageWiz

    LinkageWiz

    LinkageWiz

    Powerful Probabilistic Data Matching algorithms are used, using common identifiers such as name, date of birth, sex, address, SSN, business name and many others. Data can be imported from a wide range of desktop and corporate database systems. Data matching software will enable the detection of up to 99% or higher of all potential matches. For business this can represent considerable extra potential revenue or cost savings, increased fraud detection and, for medical research can mean the difference between a successful research project and one that failed to report any significant findings. LinkageWiz is fast, user friendly and represents outstanding value as it bundles many of the features provided by many other separate products into a single stand-alone package.
    Starting Price: $199 one-time payment
  • 17
    Plauti

    Plauti

    Plauti

    A complete data management platform native to Salesforce and Microsoft Dynamics. Verify, deduplicate, and unify siloed data. Execute smart single-click actions and intelligently assign any record, all within your CRM. Plauti is a Salesforce-native data management platform designed to ensure your customer data is accurate, complete, and actionable. It offers a seamless integration with Salesforce to verify, deduplicate, manipulate, and assign records automatically, empowering your teams to make faster, smarter decisions. Plauti’s end-to-end data orchestration ensures that your records are validated and routed correctly, enabling businesses to trust their CRM data at every stage of the record’s lifecycle. With Plauti, you can automate processes, maintain data integrity, and deliver better results without relying on external tools.
  • 18
    StarDQ

    StarDQ

    Starcom Information Technology

    A powerful, real time enterprise solution for Cleansing, De-duping, and enriching the data. By integrating StarDQ Data Validation Solution, organizations can cleanse, match and unify data across multiple data sources and data domains, to create a strategic, trustworthy, valuable asset that enhances decision making power, reduce expenses and ensure seamless customer interaction. StarDQ Self-Service Data Quality Empowers business users to quickly prepare data sets with a visual, interactive interface that is designed for ease of use and suggests one-click fixes for inaccurate, incomplete, and duplicate data. Give business users, data stewards, and IT business analysts quick access to a set of easy-to-use data integration, Reusable Cleansing & De-duplication rules to improve the value of data efficiently.
  • 19
    Quantum DXi
    High-performance, scalable backup appliances for data protection, cyber and disaster recovery. The requirements for protecting data across the Enterprise continue to get more complex. Our customers are managing massive data growth across databases, virtual environments, and unstructured data sets. They need to meet or exceed service level agreements (SLAs) to the business, both recovery time objective (RTO) and recovery point objective (RPO), with budgets that aren’t growing nearly as fast as storage requirements. And data protection itself has become more demanding, with requirements to protect against operational issues, protect data across sites, provide solutions for disaster recovery and against ransomware and other forms of cyber attacks. The DXi® series backup appliances provide a uniquely powerful solution for meeting your backup needs, SLA requirements, and cyber recovery efforts.
  • 20
    DataMatch

    DataMatch

    Data Ladder

    DataMatch Enterprise™ solution is a highly visual data cleansing application specifically designed to resolve customer and contact data quality issues. The platform leverages multiple proprietary and standard algorithms to identify phonetic, fuzzy, miskeyed, abbreviated, and domain-specific variations. Build scalable configurations for deduplication & record linkage, suppression, enhancement, extraction, and standardization of business and customer data and create a Single Source of Truth to maximize the impact of your data across the enterprise.
  • 21
    Cloudingo

    Cloudingo

    Symphonic Source

    From deduping to importing and even migrating data, Cloudingo makes it super easy to manage your customer data. Salesforce is great for managing customers. But it misses the mark when it comes to data quality. Customer data that doesn’t make sense, duplicate records, reports that are a little… off. Sound familiar? Merging dupes one-by-one, native solutions, custom code, and spreadsheets can only go so far. You shouldn’t have to think twice about the quality of your customer data. Or spend lots of time cleaning and managing Salesforce. You’ve spent too long risking relationships, losing opportunities, and dealing with clutter. It’s time to fix it. Imagine a tool, just one, that turns your dirty, confusing, unreliable Salesforce data into an efficient, lead-nurturing, sales-producing machine.
    Starting Price: $1096 per year
  • 22
    Veritas NetBackup

    Veritas NetBackup

    Veritas Technologies

    Optimized for the multicloud, extensive workload support, and ensured operational resiliency. Ensure data integrity, monitor your environment, and recover at scale to optimize your resilience. Resiliency. Migration. Snapshot orchestration. Disaster recovery. Unified, end-to-end deduplication. One solution manages it all. The most VMs protected, recovered, and moved to the cloud. Protect VMware, Microsoft Hyper-V, Nutanix AHV, Red Hat Virtualization, AzureStack and OpenStack with automated protection and instant access to VM data via flexible recovery. At-scale disaster recovery with near-zero RPO and RTO. Protect your data with 60+ public cloud storage targets, an automated, SLA-driven resiliency platform, and a new supported integration with NetBackup. Get scale-out protection for petabyte-scale workloads with hundreds of data nodes. Use NetBackup Parallel Streaming, a modern parallel streaming agentless architecture.
  • 23
    DemandTools

    DemandTools

    Validity

    The #1 global data quality tool thousands of Salesforce administrators trust. Improve overall productivity in managing large data sets. Identify and deduplicate data within any database table. Perform multi-table mass manipulation and standardization of Salesforce objects. Bolster Lead conversion with a robust, customizable toolset. With its feature-rich data quality toolset, you can use DemandTools to cleanse, standardize, compare records, and more. With Validity Connect, you will have access to the EmailConnect module to verify email addresses on Contacts and Leads in bulk. Manage all aspects of your data in bulk with repeatable processes instead of record by record or need by need. Dedupe, standardize, and assign records automatically as they come in from spreadsheets, end user entry, and integrations. Get clean data to improve the performance of sales, marketing, and support, as well as the revenue and retention they generate.
  • 24
    Dell EMC Avamar
    Dell EMC Avamar enables fast, efficient backup and recovery through its integrated variable-length deduplication technology. Avamar is optimized for fast, daily full backups of physical and virtual environments, NAS servers, enterprise applications, remote offices and desktops/laptops. Avamar is available as a virtual edition or as a component of Dell EMC Data Protection Suite, which offers you a complete suite of data protection software options. Backup and recovery optimized for virtual environments. Enables application-consistent recovery of enterprise applications. Uses variable-length deduplication for high performance and lower cost. Provides intuitive centralized management and encryption for data security. Dell Technologies On Demand delivers the industry's broadest end-to-end portfolio of consumption-based and as-a-service solutions ideally suited for the way on-premises infrastructure and services are consumed in the on-demand economy.
  • 25
    FalconStor

    FalconStor

    FalconStor Software

    FalconStor is a trusted data protection innovator with over an Exabyte of data under management, enabling the world’s most demanding enterprises to modernize their data backup and archival operations across data centers and public clouds. The company delivers increased data security and provides the fast recovery from ransomware attack while driving down costs by up to 90 percent. FalconStor is trusted by 1,000 customers and a network of partners around the world.
  • 26
    tye.io

    tye.io

    tye GmbH

    tye is a Software-as-a-Service (SaaS) personal assistant that helps companies keep the contact information of their customers up-to-date.
  • 27
    Binary Demand

    Binary Demand

    Binary Demand

    Data is the fuel to any successful sales and marketing strategy. Data deteriorates by 2% every month. The relevance of your data collated via email marketing naturally degrade by about 22.5% every year. The absence of accurate data can make or break a business’s marketing strategy. Therefore, the need of an accurate live database becomes indispensable. Binary Demands’ global contact database can help you overhaul your marketing campaigns and strategies. Your collated data deteriorates over a period of time. Binary Demand provides custom solutions to prevent wastage of your data by making up for its natural degradation. Our customised data solutions include standardisation, de-duping, cleansing, verification etc. This helps in creating a list of probable customers based of criterias such as geography, company size, job titles, industry, etc. Our high accuracy and low cost model makes us the best ROI generating list partner in the marketplace.
  • 28
    Dell EMC PowerProtect Data Manager
    Protect data and deliver governance control for modern cloud workloads across your evolving physical, virtual and cloud environments. Address ever-changing growth and IT complexity by leveraging Dell EMC’s software defined data protection platform. PowerProtect Data Manager delivers next generation data protection that enables faster IT transformation, while giving you the assurance that you can easily safeguard and quickly unlock your data’s value. Dell EMC PowerProtect Data Manager provides software defined data protection, automated discovery, deduplication, operational agility, self-service and IT governance for physical, virtual and cloud environments. PowerProtect Data Manager offers efficient data protection capabilities leveraging the latest evolution of Dell EMC trusted protection storage architecture.
  • 29
    Datactics

    Datactics

    Datactics

    Profile, cleanse, match and deduplicate data in drag-and-drop rules studio. Lo-code UI means no programming skill required, putting power in the hands of subject matter experts. Add AI & machine learning to your existing data management processes In order to reduce manual effort and increase accuracy, providing full transparency on machine-led decisions with human-in-the-loop. Offering award-winning data quality and matching capabilities across multiple industries, our self-service solutions are rapidly configured within weeks with specialist assistance available from Datactics data engineers. With Datactics you can easily measure data to regulatory & industry standards, fix breaches in bulk and push into reporting tools, with full visibility and audit trail for Chief Risk Officers. Augment data matching into Legal Entity Masters for Client Lifecycle Management.
  • 30
    KLDiscovery

    KLDiscovery

    KLDiscovery

    KLDiscovery uses a proprietary processing application that is fast, robust and propels your processing to new levels. And because we can simultaneously deploy multiple instances of our application, we can process massive amounts of data in a fraction of the time required with other applications. We commonly process several terabytes of data in a single week. KLDiscovery can significantly reduce the overall data size by utilizing our integrated deduplication engine. This powerful tool can sweep away redundant documents by comparing custom hash values, calculated from the metadata contained within any number of up to fourteen separate fields. Because all deduplication activity gets captured within comprehensive reporting features built-in to our application, this defensible process is always tracked, recoverable and reproducible. The ability to process large volumes of data is only half the story.
  • Previous
  • You're on page 1
  • 2
  • Next