Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UFS DA cycling on gaea #2985

Open
jswhit opened this issue Oct 4, 2024 · 7 comments
Open

UFS DA cycling on gaea #2985

jswhit opened this issue Oct 4, 2024 · 7 comments
Labels
feature New feature or request triage Issues that are triage

Comments

@jswhit
Copy link
Contributor

jswhit commented Oct 4, 2024

What new functionality do you need?

support for DA cycling on gaea c5 (and also c6). After cloning and building on gaea c5, I setup an experiment and started it. The workflow ran successfully through the first forecast, but died in the first da related setup (marinebmat). Looking at the log I see "WARNING: UFSDA NOT SUPPORTED ON THIS PLATFORM". What will it take to this capability working? I've pre-staged the observations and initial conditions I will need.

BTW: The marinebmat error that follows the warning message is

++ jjob_header.sh[79]: cycle=t06z
++ jjob_header.sh[80]: setpdy.sh
/gpfs/f5/nggps_psd/proj-shared/Jeffrey.S.Whitaker/global-workflow/ush/jjob_header.sh: line 80: setpdy.sh: command not found
+ jjob_header.sh[1]: postamble JGLOBAL_MARINE_BMAT 1728083449 127

What are the requirements for the new functionality?

I don't know

Acceptance Criteria

coupled DA cycling working on at least gaea c5, and ultimately c6 also

Suggest a solution (optional)

No response

@jswhit jswhit added feature New feature or request triage Issues that are triage labels Oct 4, 2024
@jswhit
Copy link
Contributor Author

jswhit commented Oct 4, 2024

@AnilKumar-NOAA

@jswhit
Copy link
Contributor Author

jswhit commented Oct 7, 2024

made some progress by modifying env/GAEA.env to match env/HERCULES.env, and by editing ush/load_ufsda_modules.sh to remove gaea from the 'unsupported' branch. Now failing in gdasmarineanlvar when executing marinevariational/gdas.x.

@AnilKumar-NOAA
Copy link
Contributor

We have not tested DA part yet on gaea c5 system. It's good to know that you did attempt and tested DA on gaea c5. gdas sometime create issues. Will check on this on my side as well. Thanks

@jswhit
Copy link
Contributor Author

jswhit commented Oct 7, 2024

I've gotten past the crash in gdasmarineanlvar with a one line fix to scripts/exglobal_marine_analysis_initialize.py (PR from @guillaumevernieres forthcoming). However, the gdasmarineanlvar fails to finish, even if the wall clock limit is increased to from 15 mins to 1 hour. Can't tell if it's hung or just runs very slow on gaea.

@CoryMartin-NOAA
Copy link
Contributor

FYI I'm working to add C6 support to GDASApp NOAA-EMC/GDASApp#1323

@guillaumevernieres
Copy link
Contributor

guillaumevernieres commented Oct 11, 2024 via email

@CoryMartin-NOAA
Copy link
Contributor

@guillaumevernieres on Gaea C6 it looks like the compiler is intel-classic/2023.2.0, so perhaps we can see if the memory usage is problematic on these nodes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New feature or request triage Issues that are triage
Projects
None yet
Development

No branches or pull requests

4 participants