-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Microsoft.ML.TorchSharp.Tests.QATests.TestSimpleQA followed by process killed / return 137 #6978
Labels
blocking-clean-ci
Blocking PR or rolling builds
bug
Something isn't working
Known Build Error
Use this to report build issues in the .NET Helix tab
untriaged
New issue has not been triaged
Comments
ericstj
added
bug
Something isn't working
blocking-clean-ci
Blocking PR or rolling builds
labels
Jan 30, 2024
@michaelgsharp made a good observation offline - we're seeing memory usage go up quite a bit as the tests progress.
That's using 2GB memory after the previous test completed. |
Wow - the memory usage of this test is very high. Here's what I see from a local passing run on Windows.
So we may have some leak (this still shows growth) but we also are using a ton of memory when running this test. |
ericstj
added
the
Known Build Error
Use this to report build issues in the .NET Helix tab
label
Feb 7, 2024
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
blocking-clean-ci
Blocking PR or rolling builds
bug
Something isn't working
Known Build Error
Use this to report build issues in the .NET Helix tab
untriaged
New issue has not been triaged
Build Information
Build: https://dev.azure.com/dnceng-public/public/_build/results?buildId=530980&view=results
Build error leg or test failing: Microsoft.ML.TorchSharp.Tests Work Item
Pull Request #6976
Error Message
Fill the error message using step by step known issues guidance.
System Information (please complete the following information):
Describe the bug
This test is failing in CI somewhat regularly. The error pattern looks like the following:
Here are a few instances:
https://helixre107v0xd1eu3ibi6ka.blob.core.windows.net/dotnet-machinelearning-refs-pull-6974-merge-f61a125156aa4af1bd/Microsoft.ML.TorchSharp.Tests/1/console.83a6fa6c.log?helixlogtype=result
https://helixre107v0xdeko0k025g8.blob.core.windows.net/dotnet-machinelearning-refs-pull-6976-merge-0a13c2cd41724c3483/Microsoft.ML.TorchSharp.Tests/1/console.ff57f777.log?helixlogtype=result
I can't currently capture this failure in a known issue because there is no unique line logged. I've seen this failure numerous times - always when
TestSimpleQA
is running.Report
Summary
Known issue validation
Build: 🔎⚠️ Build internal information not found. This may happen if your build is too old. Please use a build that is no older than two weeks. If the problem persists, contact .NET Engineering Services Team and share this issue.
Result validation:
Validation performed at: 2/14/2024 10:25:46 PM UTC
The text was updated successfully, but these errors were encountered: