Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GH-44461: [Release][Packacing][Python] Set PARQUET_TEST_DATA on verify-release-candidate-wheels.bat #44462

Merged
merged 13 commits into from
Oct 25, 2024

Conversation

raulcd
Copy link
Member

@raulcd raulcd commented Oct 17, 2024

Rationale for this change

The Windows wheel verification fails due to missing PARQUET_TEST_DATA

What changes are included in this PR?

Add PARQUET_TEST_DATA to verify-release-candidate-wheels.bat which is only tested on the binary verification job.

Are these changes tested?

Via archery

Are there any user-facing changes?

No

Copy link

⚠️ GitHub issue #44461 has been automatically assigned in GitHub to PR creator.

@github-actions github-actions bot added the awaiting committer review Awaiting committer review label Oct 17, 2024
@raulcd
Copy link
Member Author

raulcd commented Oct 17, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

Copy link

Revision: 6ba7a5a

Submitted crossbow builds: ursacomputing/crossbow @ actions-f980a8883a

Task Status
verify-rc-binaries-wheels-windows GitHub Actions

@raulcd
Copy link
Member Author

raulcd commented Oct 17, 2024

The error has been fixed but there seems to be a new error with ORC and Timezones :( :

 >       result = dataset_reader.to_table(dataset)
C:\tmp\arrow-verify-release-wheels\_verify-wheel-3.9\Lib\site-packages\pyarrow\tests\test_dataset.py:3248: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
C:\tmp\arrow-verify-release-wheels\_verify-wheel-3.9\Lib\site-packages\pyarrow\tests\test_dataset.py:267: in to_table
    return dataset.to_table(**kwargs)
pyarrow\\_dataset.pyx:562: in pyarrow._dataset.Dataset.to_table
    ???
pyarrow\\_dataset.pyx:3841: in pyarrow._dataset.Scanner.to_table
    ???
pyarrow\\error.pxi:155: in pyarrow.lib.pyarrow_internal_check_status
    ???
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
>   ???
E   pyarrow.lib.ArrowException: Unknown error: Time zone file /usr/share/zoneinfo/UTC does not exist. Please install IANA time zone database and set TZDIR env.
pyarrow\\error.pxi:92: ArrowException

@raulcd
Copy link
Member Author

raulcd commented Oct 17, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

Copy link

Revision: 95b3016

Submitted crossbow builds: ursacomputing/crossbow @ actions-5687ee222c

Task Status
verify-rc-binaries-wheels-windows GitHub Actions

@rem Download IANA Timezone Database for ORC C++
curl https://cygwin.osuosl.org/noarch/release/tzdata/tzdata-2024a-1.tar.xz --output tzdata.tar.xz || exit /B
mkdir %USERPROFILE%\Downloads\test\tzdata
arc unarchive tzdata.tar.xz %USERPROFILE%\Downloads\test\tzdata
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

arc does not seem to be present:

'arc' is not recognized as an internal or external command,
operable program or batch file.

There's some more work to be done here :(

@github-actions github-actions bot added awaiting changes Awaiting changes and removed awaiting committer review Awaiting committer review labels Oct 17, 2024
@kou
Copy link
Member

kou commented Oct 17, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

@github-actions github-actions bot added awaiting change review Awaiting change review awaiting changes Awaiting changes and removed awaiting changes Awaiting changes awaiting change review Awaiting change review labels Oct 17, 2024

This comment was marked as outdated.

@kou
Copy link
Member

kou commented Oct 17, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

This comment was marked as outdated.

@github-actions github-actions bot added awaiting change review Awaiting change review and removed awaiting changes Awaiting changes labels Oct 18, 2024
@kou
Copy link
Member

kou commented Oct 18, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

Copy link

Revision: d43d971

Submitted crossbow builds: ursacomputing/crossbow @ actions-2851110c41

Task Status
verify-rc-binaries-wheels-windows GitHub Actions

@kou
Copy link
Member

kou commented Oct 18, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

Copy link

Revision: acf0493

Submitted crossbow builds: ursacomputing/crossbow @ actions-628dc0a01d

Task Status
verify-rc-binaries-wheels-windows GitHub Actions

@kou
Copy link
Member

kou commented Oct 18, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

Copy link

Revision: 6c3e4a5

Submitted crossbow builds: ursacomputing/crossbow @ actions-c2ac566e6f

Task Status
verify-rc-binaries-wheels-windows GitHub Actions

@kou
Copy link
Member

kou commented Oct 18, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

Copy link

Revision: a753b3b

Submitted crossbow builds: ursacomputing/crossbow @ actions-55cf78c3ae

Task Status
verify-rc-binaries-wheels-windows GitHub Actions

@kou
Copy link
Member

kou commented Oct 18, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

@kou
Copy link
Member

kou commented Oct 18, 2024

It seems that we need to update bundled datetime...

@raulcd
Copy link
Member Author

raulcd commented Oct 24, 2024

@github-actions crossbow submit verify-rc-binaries-wheels-windows --param release=18.0.0 --param rc=0

Copy link

Revision: 7200add

Submitted crossbow builds: ursacomputing/crossbow @ actions-2790dff4a5

Task Status
verify-rc-binaries-wheels-windows GitHub Actions

@raulcd
Copy link
Member Author

raulcd commented Oct 24, 2024

oh! yeah! I'm not sure why I run that, the change won't be there :)

@kou
Copy link
Member

kou commented Oct 24, 2024

Yes. I did the same mistake.

GH-44482 isn't included in 18.0.0 RC0. So this is still failing. But the PARQUET_TEST_DATA problem is fixed by this. So I think that we can merge this.

@raulcd raulcd marked this pull request as ready for review October 25, 2024 10:39
Copy link
Member Author

@raulcd raulcd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am 👍 with merging this, just a small question

@@ -29,6 +29,7 @@ jobs:
{{ key }}: {{ value }}
{% endfor %}
{% endif %}
timeout-minutes: 60
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am not against setting the timeout, any specific reason to add it? Those jobs only run on verifying binaries at the moment and it would be unfortunate to start hitting the timeout there

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some jobs such as https://github.com/ursacomputing/crossbow/actions/runs/11393132955/job/31700762815 got stuck when I was working on this. I canceled them manually because we don't have the timeout.

I think that we should have a reasonable timeout to avoid wasting CI time (even when we don't pay money for it). If "60" isn't enough, we can use more long time such as "120". It's better than the default "360".

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@github-actions github-actions bot added awaiting changes Awaiting changes and removed awaiting change review Awaiting change review labels Oct 25, 2024
@raulcd raulcd merged commit 1b40800 into apache:main Oct 25, 2024
19 checks passed
@raulcd raulcd removed the awaiting changes Awaiting changes label Oct 25, 2024
@raulcd raulcd deleted the GH-44461 branch October 25, 2024 14:49
Copy link

After merging your PR, Conbench analyzed the 3 benchmarking runs that have been run so far on merge-commit 1b40800.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details. It also includes information about 5 possible false positives for unstable benchmarks that are known to sometimes produce them.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants