这是indexloc提供的服务,不要输入任何密码
Skip to content

Add support for CAMELS DE dataset #253

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 6 commits into from
Jul 18, 2025
Merged

Conversation

andreucs
Copy link
Contributor

@andreucs andreucs commented Jul 5, 2025

CAMELS DE dataset by [#]_

References

.. [#] Loritz, R., Dolich, A., Acuña Espinoza, E., Ebeling, P., Guse, B., Götte, J., Hassler, S. K., Hauffe, C.,
Heidbüchel, I., Kiesel, J., Mälicke, M., Müller-Thomy,., Stölzle, M., and Tarasova, L. (2024). CAMELS-DE:
hydro-meteorological time series and attributes for 1582 catchments in germany. Earth System Science Data,
https://doi.org/10.5194/essd-16-5625-2024, 2024.

@andreucs andreucs requested a review from gauchm as a code owner July 5, 2025 10:16
Copy link
Member

@gauchm gauchm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your contribution! See my inline comments.

return df


def load_camels_de_timeseries(data_dir: Path, basin: str) -> pd.DataFrame:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

for multi-basin neural network training, you usually want to work with mm/day discharge and not cms. I am not fully sure how the data is provided in CAMELS-DE but I think it is cms. A few of our other classes (e.g. check CAMELSUS) have a conversion from volumetric to area-normalized units implemented. You might want to check if that is needed.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

camels_de

Thanks! I’m attaching an image from the paper by R. Loritz et al.:CAMELS_DE. As you can see, both discharge variables are provided: mm/day and cms. So I believe no conversion should be necessary in this case.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

great, thanks for checking

Co-authored-by: Martin Gauch <15731649+gauchm@users.noreply.github.com>
Copy link
Member

@gauchm gauchm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry, I missed one thing: in datasetzoo/init.get_dataset, the docstring is outdated (and was already outdated before...)

Currently implemented datasets are 'caravan', 'camels_aus', 'camels_br', 'camels_cl', 'camels_gb', 'camels_us', and
)

Can you change this to something like Currently implemented datasets are those listed in datasetzoo.get_dataset. The 'generic' dataset class can be used for any kind of dataset as long as it is the correct format.

Thanks!

@andreucs
Copy link
Contributor Author

I think that's it, if you need anything else, let me know.

Copy link
Member

@gauchm gauchm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot!

@gauchm gauchm merged commit b124dfd into neuralhydrology:master Jul 18, 2025
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants