+

Geetha et al., 2019 - Google Patents

Implementation and performance comparison of partitioning techniques in apache spark

Geetha et al., 2019

Document ID
15320554277208345232
Author
Geetha J
Harshit N
Publication year
Publication venue
2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT)

External Links

Snippet

Apache spark is one of the most demanded frameworks for High performance computing of Big Data. Data is growing day by day to such a large extent that the power of existing analytical tool is not sufficient. The degree of parallelism achieved directly impacts the …
Continue reading at ieeexplore.ieee.org (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30312Storage and indexing structures; Management thereof
    • G06F17/30321Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30386Retrieval requests
    • G06F17/30424Query processing
    • G06F17/30533Other types of queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30386Retrieval requests
    • G06F17/30424Query processing
    • G06F17/30442Query optimisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30575Replication, distribution or synchronisation of data between databases or within a distributed database; Distributed database system architectures therefor
    • G06F17/30584Details of data partitioning, e.g. horizontal or vertical partitioning
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor; File system structures therefor in structured data stores
    • G06F17/30289Database design, administration or maintenance
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30943Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type
    • G06F17/30946Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30067File systems; File servers
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures

Similar Documents

Publication Publication Date Title
Gu et al. Memory or time: Performance evaluation for iterative operation on hadoop and spark
Xie et al. Distributed power-law graph computing: Theoretical and empirical analysis
Li et al. The strategy of mining association rule based on cloud computing
Roy et al. Chaos: Scale-out graph processing from secondary storage
EP3314477B1 (en) Systems and methods for parallelizing hash-based operators in smp databases
Lee et al. Efficient and customizable data partitioning framework for distributed big RDF data processing in the cloud
Li et al. A convergence of key‐value storage systems from clouds to supercomputers
Silberstein et al. Efficient bulk insertion into a distributed ordered table
Dev et al. HAR+: Archive and metadata distribution! Why not both?
Cai et al. Memepic: Towards a unified in-memory big data management system
Premchaiswadi et al. Optimizing and tuning MapReduce jobs to improve the large‐scale data analysis process
Gabert et al. Elga: elastic and scalable dynamic graph analysis
Geetha et al. Implementation and performance comparison of partitioning techniques in apache spark
Simsiri et al. Work‐efficient parallel union‐find
Zhang et al. GraphA: Efficient partitioning and storage for distributed graph computation
Li et al. Accurate Counting Bloom Filters for Large‐Scale Data Processing
KR101451280B1 (en) Distributed database management system and method
Abughofa et al. Towards online graph processing with spark streaming
Kurt et al. A fault-tolerant environment for large-scale query processing
Zhao et al. A data locality optimization algorithm for large-scale data processing in Hadoop
Al Nuaimi et al. A Novel Approach for Dual-Direction Load Balancing and Storage Optimization in Cloud Services
EL-SAYED et al. Impact of small files on hadoop performance: literature survey and open points
Al Nuaimi et al. Dual direction load balancing and partial replication storage of cloud DaaS
Bin et al. An efficient distributed B-tree index method in cloud computing
Kumar et al. Virtualization of large-scale data storage system to achieve dynamicity and scalability in grid computing
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载