ROX-31331: Require Collector reporting all processes#17551
Conversation
|
Images are ready for the commit at f5e064e. To use with deploy scripts, first |
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## master #17551 +/- ##
=======================================
Coverage 48.71% 48.72%
=======================================
Files 2724 2724
Lines 202999 202999
=======================================
+ Hits 98893 98904 +11
+ Misses 96352 96342 -10
+ Partials 7754 7753 -1
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
/retest-times 10 ocp-4-19-nongroovy-e2e-tests |
|
/test gke-nongroovy-e2e-tests |
|
/test gke-nongroovy-e2e-tests |
|
/test ocp-4-19-nongroovy-e2e-tests |
|
/test gke-nongroovy-e2e-tests |
1 similar comment
|
/test gke-nongroovy-e2e-tests |
|
/test ocp-4-19-nongroovy-e2e-tests |
|
/test gke-nongroovy-e2e-tests |
|
/test gke-nongroovy-e2e-tests |
1 similar comment
|
/test gke-nongroovy-e2e-tests |
|
/test gke-nongroovy-e2e-tests |
|
/test ocp-4-19-nongroovy-e2e-tests |
|
/test gke-nongroovy-e2e-tests |
1 similar comment
|
/test gke-nongroovy-e2e-tests |
|
/test ocp-4-19-nongroovy-e2e-tests |
|
@vikin91: The following test failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
/test ocp-4-19-nongroovy-e2e-tests |
2 similar comments
|
/test ocp-4-19-nongroovy-e2e-tests |
|
/test ocp-4-19-nongroovy-e2e-tests |
|
/retest |
Description
Re-enable detection of
/bin/sh,/bin/date, and/bin/sleepprocesses inTestPodandTestContainerInstancesby removing TODO(ROX-31331) workarounds.Problem
Tests
TestPodandTestContainerInstanceswere not checking for/bin/sh,/bin/date, and/bin/sleepprocesses from the Ubuntu container due to unreliable Collector process detection (ROX-31331). The tests had TODO comments indicating these processes should be re-enabled once the underlying issue was resolved.Why it was failing before
We created ROX-31331, because it was easy to say that if something is missing in Central, then most probably the Collector is not reporting something. The faith in Sensor was too strong, because it turned out that Sensor was loosing selected data in the test run because it was restarted shortly before the
TestPodandTestContainerInstanceswere triggered.The issue of Sensor not being ready for test was fixed in #17502.
Changes
Removed TODO(ROX-31331) comments and re-enabled full process detection:
tests/container_instances_test.go:requiredSecondContainerfrom[]string{"/bin/sh"}[]string{"/bin/sh", "/bin/date", "/bin/sleep"}tests/pods_test.go:requiredProcessesfrom[]string{"/usr/sbin/nginx"}[]string{"/usr/sbin/nginx", "/bin/sh", "/bin/date", "/bin/sleep"}Before vs After
Before:
After:
User-facing documentation
Testing and quality
Automated testing
How I validated my change
gke-nongroovy-e2e-testsandocp-4-19-nongroovy-e2e-testsin CI 10 times each, and manually checked the results by making sure thatTestPodandTestContainerInstancesare not failing.