Sinha et al., 2021 - Google Patents
Introduction to data deduplication approachesSinha et al., 2021
- Document ID
- 12304011278264722146
- Author
- Sinha G
- Thwel T
- Mohdiwale S
- Shrivastava D
- Publication year
- Publication venue
- Data deduplication approaches
External Links
Snippet
In the field of computing, data deduplication methods are used to reduce duplicate copies and repeated data. The data storage and its utilization are improved by deduplication in addition to better network transfer. Due to reduced data storage requirement, the data …
- 238000000034 method 0 abstract description 20
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30067—File systems; File servers
- G06F17/30129—Details of further file system functionalities
- G06F17/3015—Redundancy elimination performed by the file system
- G06F17/30156—De-duplication implemented within the file system, e.g. based on file segments
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30067—File systems; File servers
- G06F17/30129—Details of further file system functionalities
- G06F17/3015—Redundancy elimination performed by the file system
- G06F17/30153—Redundancy elimination performed by the file system using compression, e.g. sparse files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/06—Digital input from or digital output to record carriers, e.g. RAID, emulated record carriers, networked record carriers
- G06F3/0601—Dedicated interfaces to storage systems
- G06F3/0628—Dedicated interfaces to storage systems making use of a particular technique
- G06F3/0638—Organizing or formatting or addressing of data
- G06F3/064—Management of blocks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F21/00—Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Lillis et al. | Current challenges and future research areas for digital forensic investigation | |
| Mohebi et al. | Iterative big data clustering algorithms: a review | |
| US9626373B2 (en) | Optimizing data block size for deduplication | |
| US8898120B1 (en) | Systems and methods for distributed data deduplication | |
| WO2020228182A1 (en) | Big data-based data deduplication method and apparatus, device, and storage medium | |
| WO2022105497A1 (en) | Text screening method and apparatus, device, and storage medium | |
| Siddiqui et al. | Pseudo-cache-based IoT small files management framework in HDFS cluster | |
| Malhotra et al. | A survey and comparative study of data deduplication techniques | |
| Bhalerao et al. | A survey: On data deduplication for efficiently utilizing cloud storage for big data backups | |
| US20140304268A1 (en) | Optimized pre-fetch ordering using de-duplication information to enhance network performance | |
| Gharaibeh et al. | A gpu accelerated storage system | |
| Wang et al. | Chunk2vec: A novel resemblance detection scheme based on Sentence‐BERT for post‐deduplication delta compression in network transmission | |
| Sinha et al. | Introduction to data deduplication approaches | |
| Hare et al. | Practical scalable image analysis and indexing using Hadoop | |
| Jun et al. | In-storage embedded accelerator for sparse pattern processing | |
| Tan et al. | SAFE: a source deduplication framework for efficient cloud backup services | |
| Yin et al. | Content‐Based Image Retrial Based on Hadoop | |
| Wu et al. | A feature-based intelligent deduplication compression system with extreme resemblance detection | |
| CN118171298A (en) | A file deduplication method and device for encrypted images in container warehouses | |
| Tao et al. | Version-vector based video data online cloud backup in smart campus | |
| Kumar et al. | Differential Evolution based bucket indexed data deduplication for big data storage | |
| Vikraman et al. | A study on various data de-duplication systems | |
| Keskin et al. | Examining the importance of artificial intelligence in the singularization of big data with the development of cloud computing | |
| Agrawal et al. | Clustered outband deduplication on primary data | |
| Ammar et al. | Improved FTWeightedHashT apriori algorithm for Big Data using Hadoop-MapReduce model |