Schelter et al., 2013 - Google Patents
Distributed matrix factorization with mapreduce using a series of broadcast-joinsSchelter et al., 2013
View PDF- Document ID
 - 9561692235355724094
 - Author
 - Schelter S
 - Boden C
 - Schenck M
 - Alexandrov A
 - Markl V
 - Publication year
 - Publication venue
 - Proceedings of the 7th ACM Conference on Recommender Systems
 
External Links
Snippet
The efficient, distributed factorization of large matrices on clusters of commodity machines is  crucial to applying latent factor models in industrial-scale recommender systems. We  propose an efficient, data-parallel low-rank matrix factorization with Alternating Least … 
    - 239000011159 matrix material 0 title abstract description 25
 
Classifications
- 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
 - G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
 - G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
 - G06F17/30386—Retrieval requests
 - G06F17/30424—Query processing
 - G06F17/30533—Other types of queries
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F9/00—Arrangements for programme control, e.g. control unit
 - G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
 - G06F9/46—Multiprogramming arrangements
 - G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
 - G06F9/5061—Partitioning or combining of resources
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
 - G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
 - G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
 - G06F17/30386—Retrieval requests
 - G06F17/30424—Query processing
 - G06F17/30477—Query execution
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F9/00—Arrangements for programme control, e.g. control unit
 - G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
 - G06F9/46—Multiprogramming arrangements
 - G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
 - G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
 - G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
 - G06F9/5038—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the execution order of a plurality of tasks, e.g. taking priority or time dependency constraints into consideration
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
 - G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
 - G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
 - G06F17/30289—Database design, administration or maintenance
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
 - G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
 - G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
 - G06F17/30575—Replication, distribution or synchronisation of data between databases or within a distributed database; Distributed database system architectures therefor
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
 - G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
 - G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
 - G06F17/30312—Storage and indexing structures; Management thereof
 - G06F17/30321—Indexing structures
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
 - G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
 - G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
 - G06F17/30587—Details of specialised database models
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F9/00—Arrangements for programme control, e.g. control unit
 - G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
 - G06F9/46—Multiprogramming arrangements
 - G06F9/48—Programme initiating; Programme switching, e.g. by interrupt
 - G06F9/4806—Task transfer initiation or dispatching
 - G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
 - G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F8/00—Arrangements for software engineering
 - G06F8/70—Software maintenance or management
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F8/00—Arrangements for software engineering
 - G06F8/60—Software deployment
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F8/00—Arrangements for software engineering
 - G06F8/40—Transformations of program code
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F11/00—Error detection; Error correction; Monitoring
 - G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
 - G06Q10/00—Administration; Management
 - G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06F—ELECTRICAL DIGITAL DATA PROCESSING
 - G06F2209/00—Indexing scheme relating to G06F9/00
 
 - 
        
- G—PHYSICS
 - G06—COMPUTING; CALCULATING; COUNTING
 - G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
 - G06N99/00—Subject matter not provided for in other groups of this subclass
 
 
Similar Documents
| Publication | Publication Date | Title | 
|---|---|---|
| Schelter et al. | Distributed matrix factorization with mapreduce using a series of broadcast-joins | |
| Sparks et al. | Automating model search for large scale machine learning | |
| Dobre et al. | Parallel programming paradigms and frameworks in big data era | |
| García-Gil et al. | A comparison on scalability for batch big data processing on Apache Spark and Apache Flink | |
| Landset et al. | A survey of open source tools for machine learning with big data in the Hadoop ecosystem | |
| Parsian | Data algorithms: recipes for scaling up with Hadoop and Spark | |
| Gunarathne et al. | Scalable parallel computing on clouds using Twister4Azure iterative MapReduce | |
| Kumar et al. | Verification and validation of mapreduce program model for parallel k-means algorithm on hadoop cluster | |
| Schelter et al. | Scalable similarity-based neighborhood methods with mapreduce | |
| Spangenberg et al. | Evaluating new approaches of big data analytics frameworks | |
| Machireddy | Data quality management and performance optimization for enterprise-scale etl pipelines in modern analytical ecosystems | |
| Ekanayake et al. | Dryadlinq for scientific analyses | |
| US20190220532A1 (en) | Data processing with nullable schema information | |
| Gunarathne et al. | Portable parallel programming on cloud and hpc: Scientific applications of twister4azure | |
| Zhang et al. | Applying twister to scientific applications | |
| Singh et al. | Data processing framework using apache and spark technologies in big data | |
| Alemi et al. | CCFinder: using Spark to find clustering coefficient in big graphs | |
| Płaza et al. | Analysis of parallel computational models for clustering | |
| Cohen et al. | High-performance statistical modeling | |
| Sarkar et al. | MapReduce: A comprehensive study on applications, scope and challenges | |
| Ruan et al. | Hymr: a hybrid mapreduce workflow system | |
| Kang et al. | Data/feature distributed stochastic coordinate descent for logistic regression | |
| Shahverdi et al. | Comparative evaluation for the performance of big stream processing systems | |
| Zou et al. | Lachesis: Automated generation of persistent partitionings for big data applications | |
| Diedhiou et al. | Performance comparison of a parallel recommender algorithm across three hadoop-based frameworks |