-
Notifications
You must be signed in to change notification settings - Fork 234
Add support for CAMELS DE dataset #253
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for your contribution! See my inline comments.
return df | ||
|
||
|
||
def load_camels_de_timeseries(data_dir: Path, basin: str) -> pd.DataFrame: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
for multi-basin neural network training, you usually want to work with mm/day discharge and not cms. I am not fully sure how the data is provided in CAMELS-DE but I think it is cms. A few of our other classes (e.g. check CAMELSUS) have a conversion from volumetric to area-normalized units implemented. You might want to check if that is needed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
great, thanks for checking
Co-authored-by: Martin Gauch <15731649+gauchm@users.noreply.github.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry, I missed one thing: in datasetzoo/init.get_dataset, the docstring is outdated (and was already outdated before...)
Currently implemented datasets are 'caravan', 'camels_aus', 'camels_br', 'camels_cl', 'camels_gb', 'camels_us', and |
Can you change this to something like Currently implemented datasets are those listed in datasetzoo.get_dataset. The 'generic' dataset class can be used for any kind of dataset as long as it is the correct format.
Thanks!
I think that's it, if you need anything else, let me know. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks a lot!
CAMELS DE dataset by [#]_
References
.. [#] Loritz, R., Dolich, A., Acuña Espinoza, E., Ebeling, P., Guse, B., Götte, J., Hassler, S. K., Hauffe, C.,
Heidbüchel, I., Kiesel, J., Mälicke, M., Müller-Thomy,., Stölzle, M., and Tarasova, L. (2024). CAMELS-DE:
hydro-meteorological time series and attributes for 1582 catchments in germany. Earth System Science Data,
https://doi.org/10.5194/essd-16-5625-2024, 2024.