+

Sinha et al., 2021 - Google Patents

Introduction to data deduplication approaches

Sinha et al., 2021

Document ID
12304011278264722146
Author
Sinha G
Thwel T
Mohdiwale S
Shrivastava D
Publication year
Publication venue
Data deduplication approaches

External Links

Snippet

In the field of computing, data deduplication methods are used to reduce duplicate copies and repeated data. The data storage and its utilization are improved by deduplication in addition to better network transfer. Due to reduced data storage requirement, the data …
Continue reading at www.sciencedirect.com (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30067File systems; File servers
    • G06F17/30129Details of further file system functionalities
    • G06F17/3015Redundancy elimination performed by the file system
    • G06F17/30156De-duplication implemented within the file system, e.g. based on file segments
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30067File systems; File servers
    • G06F17/30129Details of further file system functionalities
    • G06F17/3015Redundancy elimination performed by the file system
    • G06F17/30153Redundancy elimination performed by the file system using compression, e.g. sparse files
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30386Retrieval requests
    • G06F17/30424Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30861Retrieval from the Internet, e.g. browsers
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from or digital output to record carriers, e.g. RAID, emulated record carriers, networked record carriers
    • G06F3/0601Dedicated interfaces to storage systems
    • G06F3/0628Dedicated interfaces to storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/064Management of blocks
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F7/00Methods or arrangements for processing data by operating upon the order or content of the data handled
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints

Similar Documents

Publication Publication Date Title
Lillis et al. Current challenges and future research areas for digital forensic investigation
Mohebi et al. Iterative big data clustering algorithms: a review
US9626373B2 (en) Optimizing data block size for deduplication
US8898120B1 (en) Systems and methods for distributed data deduplication
WO2020228182A1 (en) Big data-based data deduplication method and apparatus, device, and storage medium
WO2022105497A1 (en) Text screening method and apparatus, device, and storage medium
Siddiqui et al. Pseudo-cache-based IoT small files management framework in HDFS cluster
Malhotra et al. A survey and comparative study of data deduplication techniques
Bhalerao et al. A survey: On data deduplication for efficiently utilizing cloud storage for big data backups
US20140304268A1 (en) Optimized pre-fetch ordering using de-duplication information to enhance network performance
Gharaibeh et al. A gpu accelerated storage system
Wang et al. Chunk2vec: A novel resemblance detection scheme based on Sentence‐BERT for post‐deduplication delta compression in network transmission
Sinha et al. Introduction to data deduplication approaches
Hare et al. Practical scalable image analysis and indexing using Hadoop
Jun et al. In-storage embedded accelerator for sparse pattern processing
Tan et al. SAFE: a source deduplication framework for efficient cloud backup services
Yin et al. Content‐Based Image Retrial Based on Hadoop
Wu et al. A feature-based intelligent deduplication compression system with extreme resemblance detection
CN118171298A (en) A file deduplication method and device for encrypted images in container warehouses
Tao et al. Version-vector based video data online cloud backup in smart campus
Kumar et al. Differential Evolution based bucket indexed data deduplication for big data storage
Vikraman et al. A study on various data de-duplication systems
Keskin et al. Examining the importance of artificial intelligence in the singularization of big data with the development of cloud computing
Agrawal et al. Clustered outband deduplication on primary data
Ammar et al. Improved FTWeightedHashT apriori algorithm for Big Data using Hadoop-MapReduce model
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载