Geetha et al., 2019 - Google Patents
Implementation and performance comparison of partitioning techniques in apache sparkGeetha et al., 2019
- Document ID
- 15320554277208345232
- Author
- Geetha J
- Harshit N
- Publication year
- Publication venue
- 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT)
External Links
Snippet
Apache spark is one of the most demanded frameworks for High performance computing of Big Data. Data is growing day by day to such a large extent that the power of existing analytical tool is not sufficient. The degree of parallelism achieved directly impacts the …
- 238000000638 solvent extraction 0 title abstract description 64
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30312—Storage and indexing structures; Management thereof
- G06F17/30321—Indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30533—Other types of queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30386—Retrieval requests
- G06F17/30424—Query processing
- G06F17/30442—Query optimisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30575—Replication, distribution or synchronisation of data between databases or within a distributed database; Distributed database system architectures therefor
- G06F17/30584—Details of data partitioning, e.g. horizontal or vertical partitioning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30289—Database design, administration or maintenance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30943—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type
- G06F17/30946—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30067—File systems; File servers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Gu et al. | Memory or time: Performance evaluation for iterative operation on hadoop and spark | |
Xie et al. | Distributed power-law graph computing: Theoretical and empirical analysis | |
Li et al. | The strategy of mining association rule based on cloud computing | |
Roy et al. | Chaos: Scale-out graph processing from secondary storage | |
EP3314477B1 (en) | Systems and methods for parallelizing hash-based operators in smp databases | |
Lee et al. | Efficient and customizable data partitioning framework for distributed big RDF data processing in the cloud | |
Li et al. | A convergence of key‐value storage systems from clouds to supercomputers | |
Silberstein et al. | Efficient bulk insertion into a distributed ordered table | |
Dev et al. | HAR+: Archive and metadata distribution! Why not both? | |
Cai et al. | Memepic: Towards a unified in-memory big data management system | |
Premchaiswadi et al. | Optimizing and tuning MapReduce jobs to improve the large‐scale data analysis process | |
Gabert et al. | Elga: elastic and scalable dynamic graph analysis | |
Geetha et al. | Implementation and performance comparison of partitioning techniques in apache spark | |
Simsiri et al. | Work‐efficient parallel union‐find | |
Zhang et al. | GraphA: Efficient partitioning and storage for distributed graph computation | |
Li et al. | Accurate Counting Bloom Filters for Large‐Scale Data Processing | |
KR101451280B1 (en) | Distributed database management system and method | |
Abughofa et al. | Towards online graph processing with spark streaming | |
Kurt et al. | A fault-tolerant environment for large-scale query processing | |
Zhao et al. | A data locality optimization algorithm for large-scale data processing in Hadoop | |
Al Nuaimi et al. | A Novel Approach for Dual-Direction Load Balancing and Storage Optimization in Cloud Services | |
EL-SAYED et al. | Impact of small files on hadoop performance: literature survey and open points | |
Al Nuaimi et al. | Dual direction load balancing and partial replication storage of cloud DaaS | |
Bin et al. | An efficient distributed B-tree index method in cloud computing | |
Kumar et al. | Virtualization of large-scale data storage system to achieve dynamicity and scalability in grid computing |