fix: add pre-flight module import check to exit early before API calls by aseembits93 · Pull Request #1745 · codeflash-ai/codeflash

aseembits93 · 2026-03-03T22:47:36Z

Summary

Adds a find_spec import validation check in can_be_optimized() that runs before any expensive API calls (test generation + optimization)
If the import fails due to a wrong module-root, automatically infers the correct value from the __init__.py chain, updates pyproject.toml, and retries
If the import fails due to a missing dependency (not a module-root issue), fails early with a clear error message instead of wasting API credits

Test plan

Set module-root to an incorrect value in pyproject.toml and run codeflash --file <file> --function <func> — verify it auto-corrects and updates pyproject.toml
Remove a dependency import and verify the pre-flight check catches it before API calls
Run with a correct module-root and verify no regression — optimization proceeds normally

🤖 Generated with Claude Code

When module-root is misconfigured or dependencies are missing, the optimization pipeline previously made expensive API calls (test generation + optimization) before discovering the failure at baseline test execution. This adds a subprocess-based import check in can_be_optimized() that runs before any API calls. If the import fails, it attempts to auto-correct module-root by inferring it from the __init__.py chain and updating pyproject.toml. If auto-correction isn't possible (e.g. missing dependency), it fails early with a clear error message. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

claude · 2026-03-03T22:48:13Z

Claude finished @aseembits93's task in 5m 18s —— View job

PR Review Summary

Triage PR scope
Run lint/typecheck — fixed 1 ruff (unsorted imports) + 3 new mypy errors (commit 0404570f)
Resolve stale review threads — none found
Code review
Duplicate detection
Test coverage
Merge optimization PRs — skipped (requires elevated permissions not granted)

Prek Checks

All ruff and format checks pass after fixing the unsorted-import error in function_optimizer.py.

Code Review

Bugs / Issues found:

infer_module_root_from_file return type annotation is misleading (code_utils.py:383): The function is declared -> Path | None but never actually returns None — the fallback always returns file_path.parent. As a result, the if inferred_root is None guard in try_correct_module_root is dead code. The return type should be -> Path. Fix this →
Double validate_module_import call when auto-correction succeeds (function_optimizer.py:78,109): try_correct_module_root() internally calls validate_module_import to confirm the new root works (line 78), and then can_be_optimized() calls it again immediately after (line 109) on the same path. When auto-correction succeeds, the import is validated twice unnecessarily. Minor performance issue but worth noting.
sys.path mutation in validate_module_import is not thread-safe (code_utils.py:368): In the --worktree parallel optimization mode, concurrent calls could race on sys.path.insert/remove. This is low risk since the pre-flight check runs before parallelism begins, but worth a note if the function is reused elsewhere.
Pending reviewer request: pre_flight hook in base class (function_optimizer.py): KRRT7's second review comment ("if pre-flight doesn't exist in the language base add it") hasn't been addressed. The current implementation overrides can_be_optimized() directly. A cleaner design would be to add a pre_flight() -> Result[None, str] | None no-op hook to FunctionOptimizer and call it at the top of can_be_optimized(), then override just that hook in PythonFunctionOptimizer. This would avoid duplicating the super().can_be_optimized() dispatch pattern if other language optimizers need pre-flight checks in the future.

Looks good:

The find_spec approach (no subprocess, no actual module init) is the right lightweight check
The sys.path cleanup via finally is correct
The __init__.py chain walking logic in infer_module_root_from_file is sound
The pyproject.toml update uses tomlkit (preserves formatting) correctly
The in-memory config update (self.args.module_root, self.project_root, self.original_module_path) covers all the fields that downstream code uses

Duplicate Detection

No duplicates detected. The validate_module_import helper in code_utils.py is distinct from the simple inline find_spec("pkg") is not None usages scattered elsewhere (comparator, code_extractor) — those are one-line boolean checks while this function adds sys.path management and structured error reporting.

Test Coverage

All 21 tests in tests/code_utils/test_module_import_validation.py pass. The new validate_module_import and infer_module_root_from_file functions are well-covered (lines 357–410 not in the missing-lines list). The try_correct_module_root path is covered by the 5 tests in TestTryCorrectModuleRoot.

Last updated: 2026-03-04 | | Branch

KRRT7 · 2026-03-03T22:50:22Z

a full subprocess call is far more expensive than how we check if something is installed in the comparator:
https://github.com/codeflash-ai/codeflash/blob/main/codeflash/verification/comparator.py#L21

you can use that for this

The function should return the topmost directory with __init__.py (the top-level package), not the first directory without it. This matches the convention where module-root = package directory and project_root = its parent (the PYTHONPATH entry). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

aseembits93 · 2026-03-03T23:05:30Z

this pr is still in progress

The pre-flight check only tried to correct module-root when the import failed, but imports can succeed even with a misconfigured module-root, causing downstream test result parsing failures after expensive API calls. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Cover validate_module_import, infer_module_root_from_file, and FunctionOptimizer.try_correct_module_root with 21 tests. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Tests were patching functions at their definition modules (e.g. importlib.util.find_spec) instead of where they are imported and used (e.g. codeflash.code_utils.code_utils.find_spec). With `from X import Y`, the local binding is unaffected by patching X.Y. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

aseembits93 · 2026-03-04T00:08:36Z

@claude is the windows unittest failure related to this pull request?

claude · 2026-03-04T00:09:08Z

Claude finished @aseembits93's task in 48s —— View job

Windows unittest failure analysis

Check CI status for failing workflows
Analyze failing test details
Determine if failure is related to this PR

No, the Windows unittest failure is not related to this PR.

The failing test is:

FAILED tests/test_languages/test_javascript_test_runner.py::TestBundledJestReporter::test_reporter_produces_valid_junit_xml

The error is a Windows path separator issue in the Jest reporter module path:

Error: Cannot find module 'D:acodeflashcodeflashpackagescodeflashruntimejest-reporter.js'

The path D:acodeflash... is a mangled Windows path — backslashes are being dropped when constructing the path to jest-reporter.js. This is a pre-existing issue in the JavaScript test runner, completely unrelated to this PR's changes.

Evidence that this PR is not the cause:

This PR only touches codeflash/code_utils/code_utils.py, codeflash/optimization/function_optimizer.py, and tests/code_utils/test_module_import_validation.py
The new test file tests/code_utils/test_module_import_validation.py passed all 21 tests on Windows (.....................)
The failing test is in test_javascript_test_runner.py — JavaScript/Jest infrastructure, nothing this PR modified

The Windows Jest reporter path bug is a separate pre-existing issue that would need to be fixed in the JavaScript test runner code.

| View failing job

aseembits93 · 2026-03-04T00:11:38Z

@claude fix the failing tests/test_languages/test_javascript_test_runner.py::TestBundledJestReporter::test_reporter_produces_valid_junit_xml

claude · 2026-03-04T00:12:32Z

Claude finished @aseembits93's task in 1m 12s —— View job

Fix: Windows path separator issue in Jest reporter test

Triage request
Read failing test file — root cause identified
Implement the fix
Commit and push

Root Cause

In test_reporter_produces_valid_junit_xml, Windows Path objects use backslashes which become JavaScript escape sequences when embedded directly in the JS string template:

# Before (broken on Windows):
process.env.JEST_JUNIT_OUTPUT_FILE = '{output_file}';  # D:\a\... → D:a... (backslash escapes mangled)
const Reporter = require('{reporter_path}');

Fix (commit `7014a955`)

Used .as_posix() for both paths when embedding in the JS script — Node.js accepts forward slashes on Windows:

# After (works on all platforms):
process.env.JEST_JUNIT_OUTPUT_FILE = '{output_file.as_posix()}';
const Reporter = require('{reporter_path.as_posix()}');

| Branch

…plate Windows Path backslashes become JS escape sequences when interpolated directly into the script string, mangling the path. Co-authored-by: Aseem Saxena <aseembits93@users.noreply.github.com>

KRRT7 · 2026-03-04T00:37:43Z

codeflash/optimization/function_optimizer.py

                    f"Cannot optimize without tests when --no-gen-tests is set."
                )

+        # Pre-flight: verify module-root consistency and importability before expensive API calls


instead of implementing it in the function optimizer here, can you find a good spot for it in https://github.com/codeflash-ai/codeflash/blob/main/codeflash/languages/python/function_optimizer.py?

if pre-flight doesn't exist in the language base add it

…ythonFunctionOptimizer The try_correct_module_root() method and the Python import validation guard in can_be_optimized() used Python-only concepts (__init__.py chain walking, pyproject.toml updates). Moving them to PythonFunctionOptimizer keeps the base FunctionOptimizer language-agnostic. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…rdering Co-authored-by: Aseem Saxena <aseembits93@users.noreply.github.com>

aseembits93 and others added 3 commits March 3, 2026 15:12

use find spec instead of subprocess import

4ebf9df

test: add unit tests for early module import validation feature

8d0d883

Cover validate_module_import, infer_module_root_from_file, and FunctionOptimizer.try_correct_module_root with 21 tests. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

aseembits93 requested a review from KRRT7 March 3, 2026 23:30

aseembits93 and others added 2 commits March 3, 2026 15:34

only top level imports

4d21542

aseembits93 requested a review from misrasaurabh1 March 3, 2026 23:53

codeflash-ai deleted a comment from claude bot Mar 4, 2026

fix: use as_posix() for paths embedded in Jest reporter JS string tem…

7014a95

…plate Windows Path backslashes become JS escape sequences when interpolated directly into the script string, mangling the path. Co-authored-by: Aseem Saxena <aseembits93@users.noreply.github.com>

KRRT7 requested changes Mar 4, 2026

View reviewed changes

aseembits93 and others added 2 commits March 3, 2026 18:53

fix: guard against None args in try_correct_module_root, fix import o…

0404570

…rdering Co-authored-by: Aseem Saxena <aseembits93@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add pre-flight module import check to exit early before API calls#1745

fix: add pre-flight module import check to exit early before API calls#1745
aseembits93 wants to merge 10 commits intomainfrom
fix/early-module-import-validation

aseembits93 commented Mar 3, 2026 •

edited

Loading

Uh oh!

claude bot commented Mar 3, 2026 •

edited

Loading

Uh oh!

KRRT7 commented Mar 3, 2026

Uh oh!

aseembits93 commented Mar 3, 2026

Uh oh!

aseembits93 commented Mar 4, 2026

Uh oh!

claude bot commented Mar 4, 2026 •

edited

Loading

Uh oh!

aseembits93 commented Mar 4, 2026

Uh oh!

claude bot commented Mar 4, 2026 •

edited

Loading

Uh oh!

KRRT7 Mar 4, 2026

Uh oh!

KRRT7 Mar 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

aseembits93 commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

claude bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review Summary

Prek Checks

Code Review

Duplicate Detection

Test Coverage

Uh oh!

KRRT7 commented Mar 3, 2026

Uh oh!

aseembits93 commented Mar 3, 2026

Uh oh!

aseembits93 commented Mar 4, 2026

Uh oh!

claude bot commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Windows unittest failure analysis

Uh oh!

aseembits93 commented Mar 4, 2026

Uh oh!

claude bot commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fix: Windows path separator issue in Jest reporter test

Root Cause

Fix (commit 7014a955)

Uh oh!

KRRT7 Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

KRRT7 Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aseembits93 commented Mar 3, 2026 •

edited

Loading

claude bot commented Mar 3, 2026 •

edited

Loading

claude bot commented Mar 4, 2026 •

edited

Loading

claude bot commented Mar 4, 2026 •

edited

Loading

Fix (commit `7014a955`)

KRRT7 Mar 4, 2026 •

edited

Loading