ROX-33621: Refactor VM host probing and diagnostics by vikin91 · Pull Request #19458 · stackrox/stackrox

vikin91 · 2026-03-17T13:03:32Z

Description

This PR refactors VM roxagent discovery/diagnostics to share host probing logic and reduces noisy/high-cardinality diagnostics output.

Introduces compliance/virtualmachines/roxagent/internal/hostprobe as a shared probe layer for host-path resolution, DNF version detection, entitlement pair detection, and .repo counting.
Reworks roxagent/index diagnostics to use compact operational logs (DNF version summary, entitlement status, yum .repo count, and repo2cpe fetch diagnostics with clear failure context).
Adds --debug-gated debug logger setup for roxagent diagnostics.
Keeps backward-compatible Sensor metric labels (dnf_metadata_status) and moves granular DNF state to a separate low-cardinality metric (virtual_machine_discovered_data_dnf_status_total).
Expands unit coverage for host probing edge-cases (missing/unreadable paths, symlink edge-case for DNF version detection, entitlement pair matching semantics).

AI/user contribution note:

AI-generated: most refactoring scaffolding (shared hostprobe package extraction, diagnostics rewiring, metric split, and initial/expanded test scaffolding).
User-authored and corrected: logging contract decisions, success/warn behavior expectations, error-context requirements, and final review-driven corrections.

User-facing documentation

CHANGELOG.md is updated OR update is not needed
documentation PR is created and is linked above OR is not needed

Testing and quality

the change is production ready: the change is GA, or otherwise the functionality is gated by a feature flag
CI results are inspected

Automated testing

How I validated my change

Unit tests locally and in CI
Manually by running the agent on a VM:

[cloud-user@rhel9-1 ~]$ sudo /home/cloud-user/vm-agent-amd64 --port 818 --host-path /tmp/roxroot

virtualmachines/roxagent/index: 2026/03/17 15:34:32.588276 index.go:78: Info: Indexer config: HostPath="/tmp/roxroot", Repo2CPEMappingURL="https://security.access.redhat.com/data/metrics/repository-to-cpe.json", Timeout=30s
virtualmachines/roxagent/index: 2026/03/17 15:34:32.610766 diagnostics.go:26: Info: DNF v4 found                                                                           
virtualmachines/roxagent/index: 2026/03/17 15:34:32.616007 diagnostics.go:37: Info: Entitlement certificates found                                                         
virtualmachines/roxagent/index: 2026/03/17 15:34:32.624617 diagnostics.go:45: Info: Yum repo dir contains 1 repositories                                                   
virtualmachines/roxagent/index: 2026/03/17 15:34:32.627451 diagnostics.go:78: Info: Attempting to fetch repo2cpe mapping for diagnostics...                               
virtualmachines/roxagent/index: 2026/03/17 15:34:33.148869 diagnostics.go:105: Info: Repo2CPE mapping downloaded successfully                                              
virtualmachines/roxagent/index: 2026/03/17 15:34:33.151060 index.go:85: Info: Running node indexer...                                                                      

## Produced by clair-core
{"level":"warn","component":"rhel/PackageScanner.Scan","db":"sqlite:var/lib/rpm","layer":"sha256:aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa","version":"1","see-also":["https://github.com/rpm-software-management/rpm/issues/2796","https://github.com/rpm-software-management/rpm/discussions/3703","https://github.com/rpm-software-management/rpm/pull/3755"],"message":"rpm source packages always record 0 epoch; this may cause incorrect matching"}                                                

virtualmachines/roxagent/index: 2026/03/17 15:34:47.380907 diagnostics.go:56: Info: Index report summary: packages=512, repositories=2, distributions=0, environments=512  
virtualmachines/roxagent/index: 2026/03/17 15:34:47.383307 diagnostics.go:60: Info:   repo id=rhel-9-for-x86_64-appstream-rpms name="rhel-9-for-x86_64-appstream-rpms" key="rhel-cpe-repository" cpe="cpe:2.3:a:redhat:enterprise_linux:9:*:appstream:*:*:*:*:*"                                                                                      
virtualmachines/roxagent/index: 2026/03/17 15:34:47.385358 diagnostics.go:60: Info:   repo id=rhel-9-for-x86_64-baseos-rpms name="rhel-9-for-x86_64-baseos-rpms" key="rhel-cpe-repository" cpe="cpe:2.3:o:redhat:enterprise_linux:9:*:baseos:*:*:*:*:*"                                                                                              
virtualmachines/roxagent/vsock: 2026/03/17 15:34:47.425400 client.go:82: Info: Sent message with index report containing 512 packages, 2 repositories, and 512 environments

[cloud-user@rhel9-1 ~]$

Sensor logs:

sensor-58fcd95566-dklvw common/virtualmachine/index: 2026/03/17 14:46:32.506379 service_impl.go:74: Info: VM discovered data: detected_os=RHEL, os_version="9.7", activation_status=ACTIVE, dnf_status=[DNF_REPO_CONFIG_PRESENT, DNF_CACHE_PRESENT, DNF_HISTORY_DB_PRESENT, DNF_VERSION_4]

vikin91 · 2026-03-17T13:03:33Z

This change is part of the following stack:

ROX-33621: Refactor VM host probing and diagnostics #19458 ◀

_{Change managed by git-spice.}

openshift-ci · 2026-03-17T13:03:37Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

sourcery-ai

Hey - I've left some high level feedback:

The new diagnostics in logIndexReportDiagnostics log every repository and distribution unconditionally; consider gating the per-repo/dist lines (and possibly the filesystem diagnostics) behind the --debug flag or similar to avoid very verbose logs on large systems.
In logFilesystemDiagnostics, YumRepoCount treating a missing repo directory the same as an unreadable one results in a generic Yum repo dir unreadable message; it may be more actionable to distinguish os.IsNotExist from permission errors in both the helper and the log message.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- The new diagnostics in `logIndexReportDiagnostics` log every repository and distribution unconditionally; consider gating the per-repo/dist lines (and possibly the filesystem diagnostics) behind the `--debug` flag or similar to avoid very verbose logs on large systems.
- In `logFilesystemDiagnostics`, `YumRepoCount` treating a missing repo directory the same as an unreadable one results in a generic `Yum repo dir unreadable` message; it may be more actionable to distinguish `os.IsNotExist` from permission errors in both the helper and the log message.

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

rhacs-bot · 2026-03-17T13:27:47Z

Images are ready for the commit at 2f59bb3.

To use with deploy scripts, first export MAIN_IMAGE_TAG=4.11.x-350-g2f59bb3981.

codecov · 2026-03-17T14:27:24Z

Codecov Report

❌ Patch coverage is 80.51948% with 30 lines in your changes missing coverage. Please review.
✅ Project coverage is 49.33%. Comparing base (d0b539b) to head (2f59bb3).
⚠️ Report is 11 commits behind head on master.

Files with missing lines	Patch %	Lines
...liance/virtualmachines/roxagent/vsock/discovery.go	75.71%	15 Missing and 2 partials ⚠️
sensor/common/virtualmachine/index/service_impl.go	63.63%	8 Missing ⚠️
...rtualmachines/roxagent/internal/hostprobe/probe.go	91.37%	3 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #19458      +/-   ##
==========================================
- Coverage   49.56%   49.33%   -0.23%     
==========================================
  Files        2711     2723      +12     
  Lines      204849   205270     +421     
==========================================
- Hits       101524   101268     -256     
- Misses      95744    96465     +721     
+ Partials     7581     7537      -44

Flag	Coverage Δ
go-unit-tests	`49.33% <80.51%> (-0.23%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

vikin91 · 2026-03-17T16:43:57Z

@sourcery-ai recheck

sourcery-ai · 2026-03-17T16:44:06Z

Sure! I'm generating a new review now.

sourcery-ai

Hey - I've found 3 issues, and left some high level feedback:

In discoverDnfStatusFlags, consider reusing hostprobe.DetectDNFVersion instead of re-deriving DNF version from history DB paths to keep version-detection logic centralized and avoid future drift.
In discoverDnfCacheRepoDirPresence, the DNF5 detection logic relies on strings.Contains(cachePath, "/libdnf5"); it would be more robust to infer DNF5 based on the specific cache root being iterated (e.g., the input path) rather than substring matching on the full path.

Prompt for AI Agents

Please address the comments from this code review:

## Overall Comments
- In `discoverDnfStatusFlags`, consider reusing `hostprobe.DetectDNFVersion` instead of re-deriving DNF version from history DB paths to keep version-detection logic centralized and avoid future drift.
- In `discoverDnfCacheRepoDirPresence`, the DNF5 detection logic relies on `strings.Contains(cachePath, "/libdnf5")`; it would be more robust to infer DNF5 based on the specific cache root being iterated (e.g., the input path) rather than substring matching on the full path.

## Individual Comments

### Comment 1
<location path="compliance/virtualmachines/roxagent/index/index.go" line_range="78-85" />
<code_context>
+		indexerCfg.HostPath, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout)
+
+	// This may slow the indexing process down by 1-2 seconds, but the diagnostics are invaluable for debugging.
+	logFilesystemDiagnostics(indexerCfg.HostPath)
+	fetchRepo2CPEMappingForDiagnostics(ctx, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout, httpClient)
+
+	log.Info("Running node indexer...")
</code_context>
<issue_to_address>
**suggestion (performance):** Diagnostics and repo2cpe fetch run unconditionally and may add network/latency overhead for every index run

`logFilesystemDiagnostics` and `fetchRepo2CPEMappingForDiagnostics` now run on every `runIndexer` call, even when not debugging. The repo2cpe call adds a network round-trip and full-body read, which can materially slow indexing and add external coupling per run. Please gate these behind `cfg.Debug` (or a dedicated diagnostics flag), or at least make the repo2cpe fetch conditional, so normal runs avoid this overhead and only pay the cost when troubleshooting.

```suggestion
	log.Infof("Indexer config: HostPath=%q, Repo2CPEMappingURL=%q, Timeout=%v",
		indexerCfg.HostPath, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout)

	if cfg.Debug {
		// These diagnostics may slow the indexing process down by 1-2 seconds due to filesystem inspection
		// and an extra repo2cpe network round-trip, so run them only in debug mode.
		logFilesystemDiagnostics(indexerCfg.HostPath)
		fetchRepo2CPEMappingForDiagnostics(ctx, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout, httpClient)
	}

	log.Info("Running node indexer...")
```
</issue_to_address>

### Comment 2
<location path="compliance/virtualmachines/roxagent/vsock/discovery.go" line_range="276" />
<code_context>
-			return false, fmt.Errorf("unsupported OS detected: missing %s: %w", cachePath, err)
-		}
-		return false, fmt.Errorf("reading %s: %w", cachePath, err)
+func discoverDnfCacheRepoDirPresence(hostPath string, cacheDirPaths []string) (hasDir, hasRepo bool, err error) {
+	if len(cacheDirPaths) == 0 {
+		return false, false, errors.New("missing cache directories")
</code_context>
<issue_to_address>
**suggestion (bug_risk):** DNF5 detection via `strings.Contains(cachePath, "/libdnf5")` is brittle and may misclassify future layouts

This logic tightly couples behavior to the literal path string and may fail if cache roots change (different prefixes, symlinks) or if an unrelated path happens to contain `libdnf5`. Since the caller already passes explicit `cacheDirPaths`, consider deciding DNF5 vs non-DNF based on which configured root you’re iterating (e.g., compare against a known constant or index) rather than substring-matching the host-adjusted path, so the behavior remains stable if paths change.

Suggested implementation:

```golang
const dnf5CacheRoot = "/var/cache/libdnf5"

//   - Tested against DNF4 defaults; DNF5 uses /var/cache/libdnf5:
//     https://github.com/rpm-software-management/dnf5/blob/185eaef1e0ad663bdb827a2179ab1df574a27d88/include/libdnf5/conf/const.hpp
func discoverDnfCacheRepoDirPresence(hostPath string, cacheDirPaths []string) (hasDir, hasRepo bool, err error) {

```

```golang
	for _, cacheDirPath := range cacheDirPaths {
		// Decide DNF5 vs non-DNF based on the configured cache root, not the host-adjusted path.
		// This avoids coupling behavior to the literal host path and keeps detection stable if
		// host path prefixes or layouts change.
		isDnf5Root := cacheDirPath == dnf5CacheRoot

		cachePath := hostprobe.HostPathFor(hostPath, cacheDirPath)

```

```golang
		if isDnf5Root {

```
</issue_to_address>

### Comment 3
<location path="compliance/virtualmachines/roxagent/index/diagnostics.go" line_range="23" />
<code_context>
+	zlog.Set(&l)
+}
+
+func logFilesystemDiagnostics(hostPath string) {
+	switch hostprobe.DetectDNFVersion(hostPath) {
+	case hostprobe.DNFVersion5:
</code_context>
<issue_to_address>
**issue (complexity):** Consider extracting small helpers for error handling, repeated logging patterns, and HTTP body checks to keep these diagnostics functions linear and less repetitive without changing behavior.

You can reduce the added complexity without changing behavior by extracting a few tiny helpers and reusing patterns.

### 1. Factor out Yum repo error classification

`logFilesystemDiagnostics` mixes error classification and logging; a small helper keeps the main flow linear:

```go
func logYumRepoError(err error) {
	switch {
	case os.IsNotExist(err):
		log.Info("Yum repo dir not found")
	case errors.Is(err, fs.ErrPermission):
		log.Warnf("Yum repo dir is not readable: %v", err)
	default:
		log.Infof("Yum repo dir is unavailable: %v", err)
	}
}

func logFilesystemDiagnostics(hostPath string) {
	switch hostprobe.DetectDNFVersion(hostPath) {
	case hostprobe.DNFVersion5:
		log.Info("DNF history DB (v5) found")
	case hostprobe.DNFVersion4:
		log.Info("DNF history DB (v4) found")
	default:
		log.Warn("DNF history DB not found!")
	}

	hasEntitlement, err := hostprobe.HasEntitlementCertKeyPair(hostPath)
	if err != nil {
		log.Warnf("Entitlement certificates not found: %v", err)
	} else if !hasEntitlement {
		log.Warn("Entitlement certificates not found")
	} else {
		log.Info("Entitlement certificates found")
	}

	repoCount, err := hostprobe.YumRepoCount(hostPath)
	if err != nil {
		logYumRepoError(err)
		return
	}
	if repoCount == 0 {
		log.Info("Yum repo dir is present but empty (0 repositories)")
		return
	}
	log.Infof("Yum repo dir contains %d repositories", repoCount)
}
```

### 2. Reduce repetition in `logIndexReportDiagnostics`

The repo/dist loops differ only in formatting. You can encapsulate the “limited iteration with truncation” pattern:

```go
func logLimited[T any, ID comparable](
	name string,
	items map[ID]T,
	limit int,
	debug bool,
	formatter func(idx, total int, id ID, item T) string,
) {
	total := len(items)
	idx := 0
	for id, item := range items {
		idx++
		log.Info(formatter(idx, total, id, item))
		if !debug && idx == limit {
			log.Info("  (truncated for brevity)")
			break
		}
	}
}

func logIndexReportDiagnostics(report *v4.IndexReport, debug bool) {
	contents := report.GetContents()

	numPkgs := len(contents.GetPackages())
	numRepos := len(contents.GetRepositories())
	numDists := len(contents.GetDistributions())
	numEnvs := len(contents.GetEnvironments())

	log.Infof("Index report summary: packages=%d, repositories=%d, distributions=%d, environments=%d",
		numPkgs, numRepos, numDists, numEnvs)

	logLimited("repositories", contents.GetRepositories(), 10, debug,
		func(idx, total int, id string, repo *v4.Repository) string {
			return fmt.Sprintf(
				"Repository (%d of %d) id=%q name=%q key=%q cpe=%q",
				idx, total, id, repo.GetName(), repo.GetKey(), repo.GetCpe(),
			)
		})

	logLimited("distributions", contents.GetDistributions(), 10, debug,
		func(idx, total int, id string, dist *v4.Distribution) string {
			return fmt.Sprintf(
				"Distribution (%d of %d) id=%s name=%q version=%q cpe=%q did=%q",
				idx, total, id, dist.GetName(), dist.GetVersion(), dist.GetCpe(), dist.GetDid(),
			)
		})

	if numRepos == 0 {
		log.Warn("Index report contains 0 repositories. Packages will be marked as UNSCANNED.")
	}
}
```

If you prefer smaller steps, you can start with a non-generic helper just for the truncation logic and keep two separate formatters.

### 3. Simplify `fetchRepo2CPEMappingForDiagnostics` body handling

You only care about “non-empty vs empty/unavailable”, so you don't need `io.ReadAll` and `len(body)`. A single-byte read via `LimitedReader` reduces IO and branches:

```go
func fetchRepo2CPEMappingForDiagnostics(ctx context.Context, mappingURL string, timeout time.Duration, client *http.Client) {
	if mappingURL == "" {
		log.Warn("Repo2CPE mapping URL is empty")
		return
	}
	log.Info("Attempting to fetch repo2cpe mapping for diagnostics...")

	reqCtx, cancel := context.WithTimeout(ctx, timeout)
	defer cancel()

	req, err := http.NewRequestWithContext(reqCtx, http.MethodGet, mappingURL, nil)
	if err != nil {
		log.Warnf("Could not create repo2cpe mapping request for %q: %v", mappingURL, err)
		return
	}

	resp, err := client.Do(req)
	if err != nil {
		log.Warnf("Could not download repo2cpe mapping from %q: %v", mappingURL, err)
		return
	}
	defer resp.Body.Close()

	var lr io.LimitedReader
	lr.R = resp.Body
	lr.N = 1
	n, _ := io.Copy(io.Discard, &lr) // n is 0 if body is empty or not readable

	log.Debugf("Repo2CPE mapping fetch: status=%d, hasBody=%t", resp.StatusCode, n > 0)

	if resp.StatusCode >= http.StatusBadRequest || n == 0 {
		log.Warn("Repo2CPE mapping is unavailable or empty")
	} else if resp.StatusCode == http.StatusOK && n > 0 {
		log.Info("Repo2CPE mapping downloaded successfully")
	}
}
```

This preserves the logging semantics (success only on 200 + non-empty body, warn otherwise) while reducing control flow and memory usage.
</issue_to_address>

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2026-03-17T16:47:32Z

compliance/virtualmachines/roxagent/index/index.go

+	log.Infof("Indexer config: HostPath=%q, Repo2CPEMappingURL=%q, Timeout=%v",
+		indexerCfg.HostPath, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout)
+
+	// This may slow the indexing process down by 1-2 seconds, but the diagnostics are invaluable for debugging.
+	logFilesystemDiagnostics(indexerCfg.HostPath)
+	fetchRepo2CPEMappingForDiagnostics(ctx, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout, httpClient)
+
+	log.Info("Running node indexer...")


suggestion (performance): Diagnostics and repo2cpe fetch run unconditionally and may add network/latency overhead for every index run

logFilesystemDiagnostics and fetchRepo2CPEMappingForDiagnostics now run on every runIndexer call, even when not debugging. The repo2cpe call adds a network round-trip and full-body read, which can materially slow indexing and add external coupling per run. Please gate these behind cfg.Debug (or a dedicated diagnostics flag), or at least make the repo2cpe fetch conditional, so normal runs avoid this overhead and only pay the cost when troubleshooting.

Suggested change

log.Infof("Indexer config: HostPath=%q, Repo2CPEMappingURL=%q, Timeout=%v",

indexerCfg.HostPath, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout)

// This may slow the indexing process down by 1-2 seconds, but the diagnostics are invaluable for debugging.

logFilesystemDiagnostics(indexerCfg.HostPath)

fetchRepo2CPEMappingForDiagnostics(ctx, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout, httpClient)

log.Info("Running node indexer...")

log.Infof("Indexer config: HostPath=%q, Repo2CPEMappingURL=%q, Timeout=%v",

indexerCfg.HostPath, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout)

if cfg.Debug {

// These diagnostics may slow the indexing process down by 1-2 seconds due to filesystem inspection

// and an extra repo2cpe network round-trip, so run them only in debug mode.

logFilesystemDiagnostics(indexerCfg.HostPath)

fetchRepo2CPEMappingForDiagnostics(ctx, indexerCfg.Repo2CPEMappingURL, indexerCfg.Timeout, httpClient)

}

log.Info("Running node indexer...")

sourcery-ai · 2026-03-17T16:47:32Z

compliance/virtualmachines/roxagent/vsock/discovery.go

-			return false, fmt.Errorf("unsupported OS detected: missing %s: %w", cachePath, err)
-		}
-		return false, fmt.Errorf("reading %s: %w", cachePath, err)
+func discoverDnfCacheRepoDirPresence(hostPath string, cacheDirPaths []string) (hasDir, hasRepo bool, err error) {


suggestion (bug_risk): DNF5 detection via strings.Contains(cachePath, "/libdnf5") is brittle and may misclassify future layouts

This logic tightly couples behavior to the literal path string and may fail if cache roots change (different prefixes, symlinks) or if an unrelated path happens to contain libdnf5. Since the caller already passes explicit cacheDirPaths, consider deciding DNF5 vs non-DNF based on which configured root you’re iterating (e.g., compare against a known constant or index) rather than substring-matching the host-adjusted path, so the behavior remains stable if paths change.

Suggested implementation:

const dnf5CacheRoot = "/var/cache/libdnf5" // - Tested against DNF4 defaults; DNF5 uses /var/cache/libdnf5: // https://github.com/rpm-software-management/dnf5/blob/185eaef1e0ad663bdb827a2179ab1df574a27d88/include/libdnf5/conf/const.hpp func discoverDnfCacheRepoDirPresence(hostPath string, cacheDirPaths []string) (hasDir, hasRepo bool, err error) {

for _, cacheDirPath := range cacheDirPaths { // Decide DNF5 vs non-DNF based on the configured cache root, not the host-adjusted path. // This avoids coupling behavior to the literal host path and keeps detection stable if // host path prefixes or layouts change. isDnf5Root := cacheDirPath == dnf5CacheRoot cachePath := hostprobe.HostPathFor(hostPath, cacheDirPath)

if isDnf5Root {

sourcery-ai · 2026-03-17T16:47:32Z

compliance/virtualmachines/roxagent/index/diagnostics.go

+	zlog.Set(&l)
+}
+
+func logFilesystemDiagnostics(hostPath string) {


issue (complexity): Consider extracting small helpers for error handling, repeated logging patterns, and HTTP body checks to keep these diagnostics functions linear and less repetitive without changing behavior.

You can reduce the added complexity without changing behavior by extracting a few tiny helpers and reusing patterns.

1. Factor out Yum repo error classification

logFilesystemDiagnostics mixes error classification and logging; a small helper keeps the main flow linear:

func logYumRepoError(err error) { switch { case os.IsNotExist(err): log.Info("Yum repo dir not found") case errors.Is(err, fs.ErrPermission): log.Warnf("Yum repo dir is not readable: %v", err) default: log.Infof("Yum repo dir is unavailable: %v", err) } } func logFilesystemDiagnostics(hostPath string) { switch hostprobe.DetectDNFVersion(hostPath) { case hostprobe.DNFVersion5: log.Info("DNF history DB (v5) found") case hostprobe.DNFVersion4: log.Info("DNF history DB (v4) found") default: log.Warn("DNF history DB not found!") } hasEntitlement, err := hostprobe.HasEntitlementCertKeyPair(hostPath) if err != nil { log.Warnf("Entitlement certificates not found: %v", err) } else if !hasEntitlement { log.Warn("Entitlement certificates not found") } else { log.Info("Entitlement certificates found") } repoCount, err := hostprobe.YumRepoCount(hostPath) if err != nil { logYumRepoError(err) return } if repoCount == 0 { log.Info("Yum repo dir is present but empty (0 repositories)") return } log.Infof("Yum repo dir contains %d repositories", repoCount) }

2. Reduce repetition in logIndexReportDiagnostics

The repo/dist loops differ only in formatting. You can encapsulate the “limited iteration with truncation” pattern:

func logLimited[T any, ID comparable]( name string, items map[ID]T, limit int, debug bool, formatter func(idx, total int, id ID, item T) string, ) { total := len(items) idx := 0 for id, item := range items { idx++ log.Info(formatter(idx, total, id, item)) if !debug && idx == limit { log.Info(" (truncated for brevity)") break } } } func logIndexReportDiagnostics(report *v4.IndexReport, debug bool) { contents := report.GetContents() numPkgs := len(contents.GetPackages()) numRepos := len(contents.GetRepositories()) numDists := len(contents.GetDistributions()) numEnvs := len(contents.GetEnvironments()) log.Infof("Index report summary: packages=%d, repositories=%d, distributions=%d, environments=%d", numPkgs, numRepos, numDists, numEnvs) logLimited("repositories", contents.GetRepositories(), 10, debug, func(idx, total int, id string, repo *v4.Repository) string { return fmt.Sprintf( "Repository (%d of %d) id=%q name=%q key=%q cpe=%q", idx, total, id, repo.GetName(), repo.GetKey(), repo.GetCpe(), ) }) logLimited("distributions", contents.GetDistributions(), 10, debug, func(idx, total int, id string, dist *v4.Distribution) string { return fmt.Sprintf( "Distribution (%d of %d) id=%s name=%q version=%q cpe=%q did=%q", idx, total, id, dist.GetName(), dist.GetVersion(), dist.GetCpe(), dist.GetDid(), ) }) if numRepos == 0 { log.Warn("Index report contains 0 repositories. Packages will be marked as UNSCANNED.") } }

If you prefer smaller steps, you can start with a non-generic helper just for the truncation logic and keep two separate formatters.

3. Simplify fetchRepo2CPEMappingForDiagnostics body handling

You only care about “non-empty vs empty/unavailable”, so you don't need io.ReadAll and len(body). A single-byte read via LimitedReader reduces IO and branches:

func fetchRepo2CPEMappingForDiagnostics(ctx context.Context, mappingURL string, timeout time.Duration, client *http.Client) { if mappingURL == "" { log.Warn("Repo2CPE mapping URL is empty") return } log.Info("Attempting to fetch repo2cpe mapping for diagnostics...") reqCtx, cancel := context.WithTimeout(ctx, timeout) defer cancel() req, err := http.NewRequestWithContext(reqCtx, http.MethodGet, mappingURL, nil) if err != nil { log.Warnf("Could not create repo2cpe mapping request for %q: %v", mappingURL, err) return } resp, err := client.Do(req) if err != nil { log.Warnf("Could not download repo2cpe mapping from %q: %v", mappingURL, err) return } defer resp.Body.Close() var lr io.LimitedReader lr.R = resp.Body lr.N = 1 n, _ := io.Copy(io.Discard, &lr) // n is 0 if body is empty or not readable log.Debugf("Repo2CPE mapping fetch: status=%d, hasBody=%t", resp.StatusCode, n > 0) if resp.StatusCode >= http.StatusBadRequest || n == 0 { log.Warn("Repo2CPE mapping is unavailable or empty") } else if resp.StatusCode == http.StatusOK && n > 0 { log.Info("Repo2CPE mapping downloaded successfully") } }

This preserves the logging semantics (success only on 200 + non-empty body, warn otherwise) while reducing control flow and memory usage.

sourcery-ai · 2026-03-17T16:47:37Z

Hey @vikin91, I've posted a new review for you!

vikin91 added 6 commits March 17, 2026 11:27

Let agent log more details after indexing

9758057

Extend logging and VM discovery

8451f16

Refactor diagnostics

7ae52d8

Refactor discovery and diagnostics

3a667e8

Improve in a self-review process

e22d19f

Improve test coverage

ab5c6b9

openshift-ci bot added the do-not-merge/work-in-progress label Mar 17, 2026

github-actions bot added area/sensor ai-review labels Mar 17, 2026

sourcery-ai bot reviewed Mar 17, 2026

View reviewed changes

Make TestYumRepo... less flaky

bd9987a

vikin91 added 3 commits March 17, 2026 16:14

Ensure dnf5 is supported. Rephrase and truncate selected logs

c3167d1

Reformat code

fcf92a5

Reset generated/internalapi/central/v1/token_service.pb.go

2f59bb3

sourcery-ai bot reviewed Mar 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ROX-33621: Refactor VM host probing and diagnostics#19458

ROX-33621: Refactor VM host probing and diagnostics#19458
vikin91 wants to merge 10 commits intomasterfrom
piotr/ROX-33621-roxagent-diag

vikin91 commented Mar 17, 2026 •

edited

Loading

Uh oh!

vikin91 commented Mar 17, 2026 •

edited

Loading

Uh oh!

openshift-ci bot commented Mar 17, 2026

Uh oh!

sourcery-ai bot left a comment

Uh oh!

rhacs-bot commented Mar 17, 2026 •

edited

Loading

Uh oh!

codecov bot commented Mar 17, 2026 •

edited

Loading

Uh oh!

vikin91 commented Mar 17, 2026

Uh oh!

sourcery-ai bot commented Mar 17, 2026

Uh oh!

sourcery-ai bot left a comment

Uh oh!

sourcery-ai bot Mar 17, 2026

Uh oh!

sourcery-ai bot Mar 17, 2026

Uh oh!

sourcery-ai bot Mar 17, 2026

Uh oh!

sourcery-ai bot commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vikin91 commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

User-facing documentation

Testing and quality

Automated testing

How I validated my change

Uh oh!

vikin91 commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

openshift-ci bot commented Mar 17, 2026

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

rhacs-bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

vikin91 commented Mar 17, 2026

Uh oh!

sourcery-ai bot commented Mar 17, 2026

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot Mar 17, 2026

Choose a reason for hiding this comment

1. Factor out Yum repo error classification

2. Reduce repetition in logIndexReportDiagnostics

3. Simplify fetchRepo2CPEMappingForDiagnostics body handling

Uh oh!

sourcery-ai bot commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vikin91 commented Mar 17, 2026 •

edited

Loading

vikin91 commented Mar 17, 2026 •

edited

Loading

rhacs-bot commented Mar 17, 2026 •

edited

Loading

codecov bot commented Mar 17, 2026 •

edited

Loading

2. Reduce repetition in `logIndexReportDiagnostics`

3. Simplify `fetchRepo2CPEMappingForDiagnostics` body handling