Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace 2018 nonHRV satellite Zarr (NaNs for most data) #224

Open
jacobbieker opened this issue Feb 8, 2024 · 10 comments
Open

Replace 2018 nonHRV satellite Zarr (NaNs for most data) #224

jacobbieker opened this issue Feb 8, 2024 · 10 comments
Assignees

Comments

@jacobbieker
Copy link
Member

No description provided.

@jacobbieker jacobbieker self-assigned this Feb 8, 2024
@peterdudfield
Copy link
Collaborator

Need to add a bit of code that picks an time that is not in HF already

@jacobbieker
Copy link
Member Author

That's just been added.

@peterdudfield
Copy link
Collaborator

Thanks that was quick

@peterdudfield
Copy link
Collaborator

Today, 46,000 / 95,000.
Seems to be about 30,000 a month, so will take another 6 weeks

@zakwatts
Copy link

zakwatts commented Jul 8, 2024

@devsjc Is there an update on this? I'm looking to update the GCP disks for a new pvnet backtest and thought it could be good to update the 2018 as well: https://github.com/orgs/openclimatefix/projects/33/views/8?filterQuery=assignee%3Azakwatts&pane=issue&itemId=60827379

@devsjc
Copy link
Contributor

devsjc commented Jul 9, 2024

Not a backfill I was aware of, was probably being run via Jacobs login on a VM so not something I can easily track - going off of Peter's described cadence alone it sounds like it should have been finished a few months ago?

@zakwatts
Copy link

zakwatts commented Jul 9, 2024

Thanks @devsjc anyways. @peterdudfield Do you know where this data is stored? I've checked the Google Storage and its not updated for 2018 yet (still NaNs). I can see that it gets uploaded to hugging face and is there: https://huggingface.co/datasets/openclimatefix/eumetsat-rss/tree/main/data/2018. I'm guess theres an upload script to get it onto the google storage that we might need to run?

@zakwatts
Copy link

zakwatts commented Jul 9, 2024

@devsjc does the "gcp-sat-update" vm directly upload data to HF or the public Google storage?

@jacobbieker
Copy link
Member Author

Yeah, I was running it in the VM, I think it's in the attached disk there, I am not sure if it finished. The HF data might be good, but also might be NaNs so you'd need to check that.

@zakwatts
Copy link

zakwatts commented Jul 9, 2024

Thanks @jacobbieker

@peterdudfield peterdudfield assigned devsjc and unassigned jacobbieker Jul 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants