这是indexloc提供的服务,不要输入任何密码
Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
307 commits
Select commit Hold shift + click to select a range
8acea59
pretty print
LeonhardFS Sep 4, 2022
1dd8e89
fixing toPythonString for nested dicts
LeonhardFS Sep 5, 2022
3f191df
test update
LeonhardFS Sep 5, 2022
9c24184
adding typing based on structured type
LeonhardFS Sep 5, 2022
882a63c
access path redo
LeonhardFS Sep 6, 2022
610c057
renaming csv.selectionPushdown -> optimizer.selectionPushdown
LeonhardFS Sep 7, 2022
b95ff23
adding typeobject to typesystem
LeonhardFS Sep 8, 2022
cf55d4e
wip
LeonhardFS Sep 9, 2022
e721757
type object fix
LeonhardFS Sep 9, 2022
6b5b273
added basic type support
LeonhardFS Sep 9, 2022
a655cb4
some basic typing working for isinstance
LeonhardFS Sep 9, 2022
0b66a5f
new samples
LeonhardFS Sep 15, 2022
1b5f6ad
merged in recent master
LeonhardFS Sep 15, 2022
1882c2f
merge
Sep 15, 2022
e19bc03
wip
LeonhardFS Sep 15, 2022
afbde1d
Merge branch 'json' of github.com:LeonhardFS/tuplex-public into json
LeonhardFS Sep 15, 2022
5df9ade
json escaping working
LeonhardFS Sep 16, 2022
8962fdd
unescaping
LeonhardFS Sep 16, 2022
5d7a30d
wip, json parsing and loading
LeonhardFS Sep 20, 2022
79f6b7a
stupidity fix
LeonhardFS Sep 20, 2022
a131583
wip
LeonhardFS Sep 20, 2022
ea21000
more parsing
Sep 20, 2022
fcb7078
err handling
Sep 20, 2022
0322119
wip json codegen
Sep 20, 2022
eebe2c2
parsed stuff
Sep 20, 2022
5f555e6
check
Sep 20, 2022
9232afc
wip, more parsing code
Sep 21, 2022
ac13639
bugfix
Sep 21, 2022
8498fb8
string view map only via boost
Sep 21, 2022
7c0e30c
key check
Sep 21, 2022
5a314c5
parse
Sep 21, 2022
5d075db
debug print
Sep 21, 2022
c7ff59a
jsond ecoding
Sep 21, 2022
961783f
todo
Sep 21, 2022
2942614
fixes
Sep 21, 2022
c993190
double free/delete causes corruption. Change that
Sep 21, 2022
7777d14
another example
Sep 21, 2022
409a11c
wip
Sep 21, 2022
1a1e456
adding single row example, to check parsing
Sep 21, 2022
2321efd
rc defaulted
Sep 21, 2022
8d7fc8a
fixes
Sep 21, 2022
86f0a82
wip working on getting the struct dict type into llvm
Sep 21, 2022
0d72460
wip
Sep 21, 2022
4770313
flattening
Sep 21, 2022
7cf5614
flattening progress
Sep 21, 2022
fd80983
check
Sep 21, 2022
51a695b
llvm::ArrayRef fix
Sep 21, 2022
8478407
clean up
Sep 22, 2022
74079ad
refactoring parsing
Sep 22, 2022
4593564
fix
Sep 22, 2022
0fec323
fixed phi nodes
Sep 22, 2022
b6bee43
list detection, requires serialization to parse it completely in
Sep 22, 2022
f26a9c3
load to llvm ir code
Sep 22, 2022
cc67933
option fix
Sep 22, 2022
56f8eac
wip, better storing
Sep 22, 2022
8b3ec04
fix
Sep 22, 2022
db039f5
works
Sep 22, 2022
2e5edbb
json loading
Sep 22, 2022
ae2e44a
last commit for the night - tbc
Sep 22, 2022
37ca76e
draft serialization
Sep 22, 2022
df3e60e
wip serialization
Sep 22, 2022
081845b
string serialization
Sep 22, 2022
c07c7c0
fix
Sep 23, 2022
30d44b8
adding list init
Sep 23, 2022
1899d4b
removing list[tuple[...]] support for now. lists are anyways way too …
Sep 23, 2022
c92ba18
pointer and memset fixes
Sep 23, 2022
e23dfbc
wip, generating decode loop for array
Sep 23, 2022
84c5ee0
array decode on its way
Sep 23, 2022
8fb67ef
integer list decode
Sep 23, 2022
5287f57
store
Sep 23, 2022
0c0dcbb
serialization size for simple lists
Sep 23, 2022
7127b26
serialization of integer list and fix for size estimation
Sep 23, 2022
5c98508
refactor step I
Sep 23, 2022
2ff4bbf
wip, moving stuff
Sep 23, 2022
27af9d4
cmake fix
Sep 23, 2022
b61cfb1
more wip
Sep 23, 2022
d7891bc
fix
Sep 23, 2022
42ad419
fix so project comileswq
Sep 23, 2022
a0d2750
filling in more
Sep 23, 2022
bb56662
another fix
Sep 23, 2022
5967c04
getting closer...
Sep 23, 2022
8fe58f7
compiles
Sep 23, 2022
606f806
not working yet
Sep 23, 2022
f446f7b
trying simdjson::dom mode to avoid out of order issues
Sep 23, 2022
fd3ec49
wow, seems to work...
Sep 23, 2022
323e172
fix for serialization when only presence map is there
Sep 23, 2022
9f90712
fix for empty list
Sep 23, 2022
85fb175
test with opt ok too
Sep 23, 2022
29ee281
template fix
LeonhardFS Sep 23, 2022
8a722ff
refactor
Sep 23, 2022
0b85a08
Merge branch 'json' of github.com:LeonhardFS/tuplex-public into json
Sep 23, 2022
5682ba6
update
LeonhardFS Sep 23, 2022
fd9f39f
wip decoding
Sep 23, 2022
75398f1
fix for unrolling options
Sep 23, 2022
e874234
remove debug print
Sep 23, 2022
3998f05
fix
LeonhardFS Sep 23, 2022
8156896
refactor
LeonhardFS Sep 23, 2022
9f7a5be
output vars for demo
LeonhardFS Sep 23, 2022
5194a5e
adding example in
LeonhardFS Sep 23, 2022
e8be41d
counts
LeonhardFS Sep 23, 2022
613f6d6
wip
LeonhardFS Sep 23, 2022
6277325
wip
Sep 23, 2022
d717e8c
Merge branch 'json' of github.com:LeonhardFS/tuplex-public into json
Sep 23, 2022
bb68916
refactoring
Sep 25, 2022
7860610
format updates
Sep 25, 2022
39db3d9
fixes
Sep 26, 2022
45b8cf0
wip, adding support for list of tuples
Sep 26, 2022
2f9207b
wip
Sep 26, 2022
c638c32
new loadToHeap function for flattened tuple
Sep 26, 2022
893322e
fixed codegen
Sep 26, 2022
cc0c6bd
bool fix
Sep 26, 2022
77bced1
fix
Sep 26, 2022
f3b2b9a
refactor
Sep 26, 2022
f693084
wip setup
Sep 26, 2022
885ad8e
basic row stat
Sep 27, 2022
225a4de
adding memory
Sep 27, 2022
18e05be
fixing estimatio
Sep 27, 2022
1e102fa
plotting
Sep 27, 2022
0a3f5db
initial result
Sep 27, 2022
8990ae0
need to fix freeing in code
Sep 27, 2022
9d8caf2
another run with 8mb
Sep 27, 2022
4370e4e
fix
Sep 27, 2022
6794462
less verbose
Sep 27, 2022
1fd91c5
potentially need more investigation into type unification
Sep 27, 2022
5093474
typeopt is not trivial
LeonhardFS Sep 27, 2022
13efbd1
changes
LeonhardFS Sep 28, 2022
4819758
change
Sep 28, 2022
531d94e
Merge branch 'json' of github.com:LeonhardFS/tuplex-public into json
Sep 28, 2022
be9920e
bug in simdjson
Sep 28, 2022
b19dd9b
fix for whitespace lines
Sep 28, 2022
d99ebc8
row count adjust for whitespace
Sep 28, 2022
2aef165
fix for unifying empty list with any list
Sep 28, 2022
ca494e6
more decoding functions
Sep 29, 2022
7737cdb
wip
Sep 29, 2022
0f2c6cc
adding list of options and list of lists
Sep 29, 2022
e14e13e
list of strings fix
Sep 29, 2022
110c967
setup
Sep 29, 2022
b45bd81
notebook
Sep 29, 2022
319e2cf
fixing field count
Sep 29, 2022
4b221fc
plotting
LeonhardFS Sep 29, 2022
aed2ef2
Merge branch 'json' of github.com:LeonhardFS/tuplex-public into json
LeonhardFS Sep 29, 2022
aeafff3
more plotting and analysis
LeonhardFS Sep 29, 2022
bd495a7
add
LeonhardFS Sep 29, 2022
f10da04
plotting and field counts
LeonhardFS Sep 29, 2022
9ea8330
wip
Sep 29, 2022
368af56
prepping for filter promo
Sep 29, 2022
59503b0
hacky layout
Sep 29, 2022
bdff6ac
filter promotion
LeonhardFS Sep 29, 2022
af77ae2
wip
LeonhardFS Sep 29, 2022
2a96365
add
LeonhardFS Sep 29, 2022
6404c35
more examples
LeonhardFS Sep 29, 2022
f3f6441
more correct global sample
LeonhardFS Sep 29, 2022
6baa295
full promo experiment
Sep 29, 2022
0cbab71
plot update
LeonhardFS Sep 30, 2022
7a691a6
plot edits
LeonhardFS Sep 30, 2022
61ab493
prepping JSON integration
Sep 30, 2022
6c06c34
Merge branch 'json' of github.com:LeonhardFS/tuplex-public into json
Sep 30, 2022
39a67f3
wip, adding struct dict to code base
Sep 30, 2022
9a0501e
removing some debug printing
Sep 30, 2022
c80816c
fix
LeonhardFS Sep 30, 2022
6ca377b
fixes
Sep 30, 2022
52861d1
merge
Sep 30, 2022
ff2a7bd
wip tuplex support for json
LeonhardFS Sep 30, 2022
c639d5a
merging in recent master with bugfixes
Oct 3, 2022
92f40a3
wip, compile json source task builder
Oct 3, 2022
7ddc25e
adding json input
Oct 3, 2022
06ce8d9
adding JSON reader class into the mix
Oct 3, 2022
176f977
merge in v0.3.4 release
Oct 3, 2022
bf2f5e1
version update & docker update
Oct 3, 2022
8ea8e34
another fix
Oct 3, 2022
4064665
another hashmap dispatch fix
Oct 3, 2022
6203ca1
version update for docker example file
Oct 3, 2022
1c04d7a
Merge branch 'docker_v0.3.4' into json
Oct 3, 2022
3329717
json edits, reader similar to jitcsvcompiled reader
Oct 3, 2022
8a4d00d
buf fixes
Oct 3, 2022
7406b72
adding doc detection function
Oct 3, 2022
c6fcb28
sampling fix
Oct 3, 2022
bd0adb9
parsing through json file
Oct 4, 2022
4b3f61e
refactor and wip to get json parsing into code base
Oct 4, 2022
d6e7824
unwrapping layout
Oct 4, 2022
a1df6b6
add dummy
LeonhardFS Oct 4, 2022
fee9339
fix
LeonhardFS Oct 4, 2022
269e6fa
wip struct assign
LeonhardFS Oct 4, 2022
2850919
c++ move fix
LeonhardFS Oct 4, 2022
ae494eb
refactor dummy value
LeonhardFS Oct 4, 2022
7a18225
adding init vars
LeonhardFS Oct 4, 2022
8f87754
adding jit symbols
LeonhardFS Oct 4, 2022
3bb4959
wip
LeonhardFS Oct 4, 2022
aa7211d
normal-case basic print
Oct 4, 2022
d84de0b
hacky dummy to avoid full struct dict decoding
Oct 4, 2022
aba2d99
basic parse with fallback
Oct 5, 2022
0a98672
test correction
Oct 5, 2022
ec6cad4
order fix for sampling
Oct 5, 2022
c01b498
mapping dicts to pyobject for now to write less code
Oct 5, 2022
2f30c55
add new github test, need to fix tomorrow
Oct 5, 2022
b0189c2
note
Oct 5, 2022
bee3b94
struct extract fix
Oct 5, 2022
a36b07f
need to add load/store of nested dicts
Oct 5, 2022
72352e6
pretty print and counter for filtered rows
LeonhardFS Oct 5, 2022
c1cb022
Merge branch 'json' of github.com:LeonhardFS/tuplex-public into json
LeonhardFS Oct 5, 2022
c5edd47
merged in recent master and bumped version
LeonhardFS Oct 5, 2022
f098e56
loading experiment
Oct 5, 2022
824b4a5
Merge branch 'json' of github.com:LeonhardFS/tuplex-public into json
Oct 5, 2022
c624058
access path printing update
Oct 5, 2022
7323504
fix for option[struct]
Oct 5, 2022
282229d
fix, it runs. now just need to clear up everything
Oct 5, 2022
7b03d83
when unwrapping, replace with None
Oct 5, 2022
a91e36f
fix re malloced fallback row
Oct 5, 2022
e90da2c
zeroing fileinput vars
Oct 5, 2022
e1fd915
index error fixed in serialization
Oct 5, 2022
14bd387
remove debug printing
Oct 5, 2022
3bcaa5f
notes
Oct 5, 2022
944752d
fixing for py -> llvm mapping
Oct 6, 2022
6bf7dd8
adding const extract
Oct 6, 2022
02bf125
wip accessing struct dict elements via static keys
Oct 6, 2022
9cb4485
more fun with pipelines
Oct 6, 2022
41abb1e
notes and a fun application
Oct 6, 2022
a1dca99
reactivated deletion -> may lead to failures, debug tomorrow
Oct 6, 2022
9fc437b
extern C to help with rel issue
Oct 6, 2022
0abdd81
debug printing
Oct 6, 2022
84ae41d
adding missing free end update
Oct 6, 2022
13daa47
need to fix this
Oct 6, 2022
9635ea0
does getting rid of move semantics help?
Oct 6, 2022
34ef384
need to add ctor block for vars (null initializing them)
Oct 6, 2022
ed5bf8c
wip, fixing object freeing
LeonhardFS Oct 6, 2022
362a64c
merge
LeonhardFS Oct 6, 2022
85a9849
compile fi
LeonhardFS Oct 6, 2022
5f5da9e
add debug print
LeonhardFS Oct 6, 2022
cb66e4d
memory debugging
LeonhardFS Oct 6, 2022
82c8354
more minimal testing setup
LeonhardFS Oct 6, 2022
712652c
bug has something to do with freeing blocks/fallback etc.
LeonhardFS Oct 6, 2022
b6569c2
more tracing
LeonhardFS Oct 6, 2022
c5bca9b
adding def
LeonhardFS Oct 6, 2022
80c81cf
trace
LeonhardFS Oct 6, 2022
3ca9d9b
swapping test file
LeonhardFS Oct 6, 2022
4593a7f
merged in master
LeonhardFS Oct 6, 2022
86b3bae
json python bindings
LeonhardFS Oct 6, 2022
f418ce3
wip json demo
LeonhardFS Oct 6, 2022
8859b75
API fix
LeonhardFS Oct 6, 2022
2b33ffe
cleared demo
LeonhardFS Oct 6, 2022
7a9fad6
wip working on freeing
Oct 10, 2022
0b20058
wip, rearchitect freeing
Oct 10, 2022
52444f0
better free
Oct 10, 2022
78d3b28
seems to work now
Oct 10, 2022
2746228
fixes
Oct 10, 2022
799ba56
wip
Oct 10, 2022
680a374
match fixes
Oct 11, 2022
c4cbb4a
link uodate
Oct 11, 2022
d22ce50
analysis update
Oct 11, 2022
494e959
more analysis
Oct 11, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion benchmarks/311-demo/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,7 @@ def fix_zip_codes(zips):
"runTimeMemory": "128MB",
"useLLVMOptimizer": True,
"optimizer.nullValueOptimization": True,
"csv.selectionPushdown": True,
"optimizer.selectionPushdown": True,
"optimizer.generateParser": True,
"tuplex.optimizer.mergeExceptionsInOrder": False,
"csv.filterPushdown": True,
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/311-demo/tuplex_config.json
Original file line number Diff line number Diff line change
@@ -1 +1 @@
{"webui.enable": false, "executorMemory": "4G", "executorCount": 63, "driverMemory": "4G", "partitionSize": "32MB", "runTimeMemory": "4MB", "inputSplitSize": "8MB", "useLLVMOptimizer": true, "optimizer.nullValueOptimization": false, "csv.selectionPushdown": true, "optimizer.generateParser": false, "optimizer.mergeExceptionsInOrder": false, "optimizer.filterPushdown": true}
{"webui.enable": false, "executorMemory": "4G", "executorCount": 63, "driverMemory": "4G", "partitionSize": "32MB", "runTimeMemory": "4MB", "inputSplitSize": "8MB", "useLLVMOptimizer": true, "optimizer.nullValueOptimization": false, "optimizer.selectionPushdown": true, "optimizer.generateParser": false, "optimizer.mergeExceptionsInOrder": false, "optimizer.filterPushdown": true}
2 changes: 1 addition & 1 deletion benchmarks/311/create_conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@
'inputSplitSize' : args.input_split_size,
'useLLVMOptimizer' : args.opt_llvm,
'optimizer.nullValueOptimization' : args.opt_null,
'csv.selectionPushdown' : args.opt_pushdown,
'optimizer.selectionPushdown' : args.opt_pushdown,
'optimizer.generateParser' : args.opt_parser,
'optimizer.mergeExceptionsInOrder' : False,
'optimizer.filterPushdown' : args.opt_filter}
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/311/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,7 @@ def fix_zip_codes(zips):
"runTimeMemory": "128MB",
"useLLVMOptimizer": True,
"optimizer.nullValueOptimization": True,
"csv.selectionPushdown": True,
"optimizer.selectionPushdown": True,
"optimizer.generateParser": True,
"tuplex.optimizer.mergeExceptionsInOrder": False,
"csv.filterPushdown": True,
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/311/tuplex_config_mt.json
Original file line number Diff line number Diff line change
@@ -1 +1 @@
{"webui.enable": false, "executorMemory": "6G", "executorCount": 15, "driverMemory": "10G", "partitionSize": "32MB", "runTimeMemory": "64MB", "inputSplitSize": "64MB", "useLLVMOptimizer": true, "optimizer.nullValueOptimization": false, "csv.selectionPushdown": true, "optimizer.generateParser": false, "optimizer.mergeExceptionsInOrder": false, "optimizer.filterPushdown": true}
{"webui.enable": false, "executorMemory": "6G", "executorCount": 15, "driverMemory": "10G", "partitionSize": "32MB", "runTimeMemory": "64MB", "inputSplitSize": "64MB", "useLLVMOptimizer": true, "optimizer.nullValueOptimization": false, "optimizer.selectionPushdown": true, "optimizer.generateParser": false, "optimizer.mergeExceptionsInOrder": false, "optimizer.filterPushdown": true}
2 changes: 1 addition & 1 deletion benchmarks/311/tuplex_config_st.json
Original file line number Diff line number Diff line change
@@ -1 +1 @@
{"webui.enable": false, "executorMemory": "6G", "executorCount": 0, "driverMemory": "100G", "partitionSize": "32MB", "runTimeMemory": "64MB", "inputSplitSize": "64MB", "useLLVMOptimizer": true, "optimizer.nullValueOptimization": false, "csv.selectionPushdown": true, "optimizer.generateParser": false, "optimizer.mergeExceptionsInOrder": false, "optimizer.filterPushdown": true}
{"webui.enable": false, "executorMemory": "6G", "executorCount": 0, "driverMemory": "100G", "partitionSize": "32MB", "runTimeMemory": "64MB", "inputSplitSize": "64MB", "useLLVMOptimizer": true, "optimizer.nullValueOptimization": false, "optimizer.selectionPushdown": true, "optimizer.generateParser": false, "optimizer.mergeExceptionsInOrder": false, "optimizer.filterPushdown": true}
2 changes: 1 addition & 1 deletion benchmarks/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -262,7 +262,7 @@ Use following config for flights, to get it fast...
"autoUpcast":true,
"optimizer.generateParser":true,
"optimizer.nullValueOptimization": true,
"csv.selectionPushdown": true,
"optimizer.selectionPushdown": true,
"resolveWithInterpreterOnly":false}
```

Expand Down
2 changes: 1 addition & 1 deletion benchmarks/dirty_zillow/create_conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@
'inputSplitSize' : args.input_split_size,
'useLLVMOptimizer' : args.opt_llvm,
'optimizer.nullValueOptimization' : args.opt_null,
'csv.selectionPushdown' : args.opt_pushdown,
'optimizer.selectionPushdown' : args.opt_pushdown,
'optimizer.generateParser' : args.opt_parser,
'optimizer.mergeExceptionsInOrder' : False,
'optimizer.filterPushdown' : args.opt_filter}
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/dirty_zillow/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -471,7 +471,7 @@ def extractZipcode(x):
"runTimeMemory" : "128MB",
"useLLVMOptimizer" : True,
"optimizer.nullValueOptimization" : False,
"csv.selectionPushdown" : True,
"optimizer.selectionPushdown" : True,
"optimizer.generateParser" : False} # bug when using generated parser. Need to fix that.

if os.path.exists('tuplex_config.json'):
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/dirty_zillow/tuplex_config.json
Original file line number Diff line number Diff line change
@@ -1 +1 @@
{"webui.enable": false, "executorMemory": "6G", "executorCount": 15, "driverMemory": "10G", "partitionSize": "32MB", "runTimeMemory": "64MB", "inputSplitSize": "64MB", "useLLVMOptimizer": true, "optimizer.nullValueOptimization": false, "csv.selectionPushdown": true, "optimizer.generateParser": false, "optimizer.mergeExceptionsInOrder": false, "optimizer.filterPushdown": true}
{"webui.enable": false, "executorMemory": "6G", "executorCount": 15, "driverMemory": "10G", "partitionSize": "32MB", "runTimeMemory": "64MB", "inputSplitSize": "64MB", "useLLVMOptimizer": true, "optimizer.nullValueOptimization": false, "optimizer.selectionPushdown": true, "optimizer.generateParser": false, "optimizer.mergeExceptionsInOrder": false, "optimizer.filterPushdown": true}
2 changes: 1 addition & 1 deletion benchmarks/distributed/tuplex/run_tuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -180,7 +180,7 @@ def filterBd(x):
"runTimeMemory": "128MB",
"useLLVMOptimizer": True,
"optimizer.nullValueOptimization": False,
"csv.selectionPushdown": True,
"optimizer.selectionPushdown": True,
}

# Begin pipeline
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/flights/create_conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@
'inputSplitSize' : args.input_split_size,
'useLLVMOptimizer' : args.opt_llvm,
'optimizer.nullValueOptimization' : args.opt_null,
'csv.selectionPushdown' : args.opt_pushdown,
'optimizer.selectionPushdown' : args.opt_pushdown,
'optimizer.generateParser' : args.opt_parser,
'optimizer.mergeExceptionsInOrder' : False,
'optimizer.filterPushdown' : args.opt_filter,
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/flights/run_on_aws.py
Original file line number Diff line number Diff line change
Expand Up @@ -289,7 +289,7 @@ def run_tuplex(instances, branch, enableLLVMOpt, enablePushdown, num_runs=5, num
"partitionSize": "32MB",
"runTimeMemory": "256MB",
"useLLVMOptimizer": enableLLVMOpt,
"csv.selectionPushdown": enablePushdown}
"optimizer.selectionPushdown": enablePushdown}
with open(config_file, 'w') as fp:
json.dump(conf, fp)

Expand Down
4 changes: 2 additions & 2 deletions benchmarks/flights/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -77,7 +77,7 @@
"runTimeMemory" : "128MB",
"useLLVMOptimizer" : False,
"optimizer.nullValueOptimization" : False,
"csv.selectionPushdown" : True,
"optimizer.selectionPushdown" : True,
"tuplex.optimizer.generateParser": True}


Expand Down Expand Up @@ -105,7 +105,7 @@
df = df.renameColumn(c, renamed_cols[i])

# if cache/nosf mode is active, prefilter
if args.simulate_spark and conf["csv.selectionPushdown"]:
if args.simulate_spark and conf["optimizer.selectionPushdown"]:
df = df.selectColumns(time_req_cols)
if args.simulate_spark:
df = df.cache()
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/flights/tuplex_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,6 @@
"runTimeMemory": "8MB",
"useLLVMOptimizer": true,
"optimizer.nullValueOptimization": true,
"csv.selectionPushdown": true,
"optimizer.selectionPushdown": true,
"resolveWithInterpreterOnly":false,
"mergeRowsInOrder":false}
2 changes: 1 addition & 1 deletion benchmarks/flights/tuplex_config_template.json
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,6 @@
"runTimeMemory": "8MB",
"useLLVMOptimizer": true,
"optimizer.nullValueOptimization": true,
"csv.selectionPushdown": true,
"optimizer.selectionPushdown": true,
"resolveWithInterpreterOnly":false,
"mergeRowsInOrder":false}
2 changes: 1 addition & 1 deletion benchmarks/logs/create_conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@
'inputSplitSize' : args.input_split_size,
'useLLVMOptimizer' : args.opt_llvm,
'optimizer.nullValueOptimization' : args.opt_null,
'csv.selectionPushdown' : args.opt_pushdown,
'optimizer.selectionPushdown' : args.opt_pushdown,
'optimizer.generateParser' : args.opt_parser,
'optimizer.mergeExceptionsInOrder' : False,
'optimizer.filterPushdown' : args.opt_filter}
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/logs/manual_reorder/runtuplex-io.py
Original file line number Diff line number Diff line change
Expand Up @@ -212,7 +212,7 @@ def randomize_udf(x):
"runTimeMemory": "128MB",
"useLLVMOptimizer": False,
"nullValueOptimization": False,
"csv.selectionPushdown": False,
"optimizer.selectionPushdown": False,
"optimizer.generateParser": False
}

Expand Down
2 changes: 1 addition & 1 deletion benchmarks/logs/manual_reorder/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -220,7 +220,7 @@ def randomize_udf(x):
"runTimeMemory": "128MB",
"useLLVMOptimizer": False,
"nullValueOptimization": False,
"csv.selectionPushdown": False,
"optimizer.selectionPushdown": False,
"optimizer.generateParser": False
}

Expand Down
2 changes: 1 addition & 1 deletion benchmarks/logs/manual_reorder/tuplex_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@
"inputSplitSize": "64MB",
"useLLVMOptimizer": true,
"optimizer.nullValueOptimization": true,
"csv.selectionPushdown": true,
"optimizer.selectionPushdown": true,
"optimizer.generateParser": false,
"optimizer.mergeExceptionsInOrder": false,
"optimizer.filterPushdown": true}
2 changes: 1 addition & 1 deletion benchmarks/logs/microbenchmarks/single_regex/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -70,7 +70,7 @@ def ParseWithRegex(logline):
"runTimeMemory": "128MB",
"useLLVMOptimizer": False,
"nullValueOptimization": False,
"csv.selectionPushdown": False,
"optimizer.selectionPushdown": False,
"optimizer.generateParser": False
}

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,6 @@
"runTimeMemory": "256MB",
"useLLVMOptimizer": true,
"optimizer.nullValueOptimization": false,
"csv.selectionPushdown": false,
"optimizer.selectionPushdown": false,
"optimizer.generateParser": false
}
2 changes: 1 addition & 1 deletion benchmarks/logs/runtuplex-io.py
Original file line number Diff line number Diff line change
Expand Up @@ -156,7 +156,7 @@ def randomize_udf(x):
"runTimeMemory": "128MB",
"useLLVMOptimizer": False,
"nullValueOptimization": False,
"csv.selectionPushdown": False,
"optimizer.selectionPushdown": False,
"optimizer.generateParser": False
}

Expand Down
2 changes: 1 addition & 1 deletion benchmarks/logs/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -220,7 +220,7 @@ def randomize_udf(x):
"runTimeMemory": "128MB",
"useLLVMOptimizer": False,
"nullValueOptimization": False,
"csv.selectionPushdown": False,
"optimizer.selectionPushdown": False,
"optimizer.generateParser": False
}

Expand Down
2 changes: 1 addition & 1 deletion benchmarks/logs/tuplex_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@
"inputSplitSize": "64MB",
"useLLVMOptimizer": true,
"optimizer.nullValueOptimization": true,
"csv.selectionPushdown": true,
"optimizer.selectionPushdown": true,
"optimizer.generateParser": false,
"optimizer.mergeExceptionsInOrder": false,
"optimizer.filterPushdown": true}
2 changes: 1 addition & 1 deletion benchmarks/orc/runcsv-read-single.py
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@
"runTimeMemory" : "128MB",
"useLLVMOptimizer" : True,
"optimizer.nullValueOptimization" : False,
"csv.selectionPushdown" : True}
"optimizer.selectionPushdown" : True}

if os.path.exists('tuplex_config.json'):
with open('tuplex_config.json') as fp:
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/orc/runcsv-write.py
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@
"runTimeMemory" : "128MB",
"useLLVMOptimizer" : True,
"optimizer.nullValueOptimization" : False,
"csv.selectionPushdown" : True}
"optimizer.selectionPushdown" : True}

if os.path.exists('tuplex_config.json'):
with open('tuplex_config.json') as fp:
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/orc/runorc-read-single.py
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@
"runTimeMemory" : "128MB",
"useLLVMOptimizer" : True,
"optimizer.nullValueOptimization" : False,
"csv.selectionPushdown" : True}
"optimizer.selectionPushdown" : True}

if os.path.exists('tuplex_config.json'):
with open('tuplex_config.json') as fp:
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/orc/runorc-write.py
Original file line number Diff line number Diff line change
Expand Up @@ -41,7 +41,7 @@
"runTimeMemory" : "128MB",
"useLLVMOptimizer" : True,
"optimizer.nullValueOptimization" : False,
"csv.selectionPushdown" : True}
"optimizer.selectionPushdown" : True}

if os.path.exists('tuplex_config.json'):
with open('tuplex_config.json') as fp:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -121,7 +121,7 @@ def extractType(row):
t += 'sf'
else:
t += 'nosf'
if row['csv.selectionPushdown']:
if row['optimizer.selectionPushdown']:
t += '+logical'
if row['optimizer.nullValueOptimization']:
t += '+null'
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/tpch/Q06/create_conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@
'inputSplitSize' : args.input_split_size,
'useLLVMOptimizer' : args.opt_llvm,
'optimizer.nullValueOptimization' : args.opt_null,
'csv.selectionPushdown' : args.opt_pushdown,
'optimizer.selectionPushdown' : args.opt_pushdown,
'optimizer.generateParser' : args.opt_parser,
'optimizer.mergeExceptionsInOrder' : False,
'optimizer.filterPushdown' : args.opt_filter}
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/tpch/Q06/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -36,7 +36,7 @@
"runTimeMemory": "128MB",
"useLLVMOptimizer": True,
"nullValueOptimization": True,
"csv.selectionPushdown": True,
"optimizer.selectionPushdown": True,
"optimizer.generateParser": True,
}

Expand Down
2 changes: 1 addition & 1 deletion benchmarks/tpch/Q19/create_conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@
'inputSplitSize' : args.input_split_size,
'useLLVMOptimizer' : args.opt_llvm,
'optimizer.nullValueOptimization' : args.opt_null,
'csv.selectionPushdown' : args.opt_pushdown,
'optimizer.selectionPushdown' : args.opt_pushdown,
'optimizer.generateParser' : args.opt_parser,
'optimizer.mergeExceptionsInOrder' : False,
'optimizer.filterPushdown' : args.opt_filter}
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/tpch/Q19/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -69,7 +69,7 @@
"runTimeMemory": "128MB",
"useLLVMOptimizer": True,
"nullValueOptimization": True,
"csv.selectionPushdown": True,
"optimizer.selectionPushdown": True,
"optimizer.generateParser": True,
}

Expand Down
2 changes: 1 addition & 1 deletion benchmarks/zillow/Z1/baseline/tuplex_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@
"runTimeMemory": "256MB",
"useLLVMOptimizer": true,
"optimizer.nullValueOptimization": true,
"csv.selectionPushdown": true,
"optimizer.selectionPushdown": true,
"optimizer.generateParser": true,
"optimizer.filterPushdown": true
}
2 changes: 1 addition & 1 deletion benchmarks/zillow/Z1/runtuplex-io.py
Original file line number Diff line number Diff line change
Expand Up @@ -149,7 +149,7 @@ def filterBd(x):
"runTimeMemory" : "128MB",
"useLLVMOptimizer" : True,
"optimizer.nullValueOptimization" : True,
"csv.selectionPushdown" : True}
"optimizer.selectionPushdown" : True}

# # conf for r5d.4xlarge (in total: 100GB, same as spark)
# conf = {"webui.enable" : False,
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/zillow/Z1/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -155,7 +155,7 @@ def filterBd(x):
"runTimeMemory" : "128MB",
"useLLVMOptimizer" : True,
"optimizer.nullValueOptimization" : False,
"csv.selectionPushdown" : True}
"optimizer.selectionPushdown" : True}

# # conf for r5d.4xlarge (in total: 100GB, same as spark)
# conf = {"webui.enable" : False,
Expand Down
2 changes: 1 addition & 1 deletion benchmarks/zillow/Z1/tuplex_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -6,4 +6,4 @@
"runTimeMemory": "8MB",
"useLLVMOptimizer": true,
"optimizer.nullValueOptimization": true,
"csv.selectionPushdown": true}
"optimizer.selectionPushdown": true}
2 changes: 1 addition & 1 deletion benchmarks/zillow/Z2/baseline/tuplex_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@
"runTimeMemory": "256MB",
"useLLVMOptimizer": true,
"optimizer.nullValueOptimization": true,
"csv.selectionPushdown": true,
"optimizer.selectionPushdown": true,
"optimizer.generateParser": true,
"optimizer.filterPushdown": true
}
2 changes: 1 addition & 1 deletion benchmarks/zillow/Z2/runtuplex.py
Original file line number Diff line number Diff line change
Expand Up @@ -158,7 +158,7 @@ def filterBd(x):
"useLLVMOptimizer" : True,
"optimizer.nullValueOptimization" : False,
"tuplex.allowUndefinedBehavior" : False,
"csv.selectionPushdown" : True,
"optimizer.selectionPushdown" : True,
"optimizer.filterPushdown" : True,
"optimizer.generateParser": False}

Expand Down
2 changes: 1 addition & 1 deletion benchmarks/zillow/Z2/tuplex_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -6,4 +6,4 @@
"runTimeMemory": "8MB",
"useLLVMOptimizer": true,
"optimizer.nullValueOptimization": true,
"csv.selectionPushdown": true}
"optimizer.selectionPushdown": true}
Loading