fix: css_selector ignored in LXML scraping for raw:// URLs (#1484) by hafezparast · Pull Request #1833 · unclecode/crawl4ai

hafezparast · 2026-03-12T12:00:48Z

Summary

Fixes [Bug]: css_selector doesn't work But target_elements does! #1484
css_selector was ignored in _scrap() — only target_elements was applied to the DOM. Now css_selector filters first, then target_elements narrows within that selection.

Changes

crawl4ai/content_scraping_strategy.py: Added css_selector filtering before target_elements processing; target_elements now searches within the css_selector result instead of the full body

Test plan

New test suite: tests/test_issue_1484_css_selector.py (10 tests)
Regression suite: 304 passed, 1 pre-existing failure (no new regressions)

…#1484) css_selector was skipped in _scrap() — only target_elements was applied. Now css_selector filters the DOM first, then target_elements narrows within that selection. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

hafezparast mentioned this pull request Mar 12, 2026

[Bug]: css_selector doesn't work But target_elements does! #1484

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: css_selector ignored in LXML scraping for raw:// URLs (#1484)#1833

fix: css_selector ignored in LXML scraping for raw:// URLs (#1484)#1833
hafezparast wants to merge 1 commit intounclecode:developfrom
hafezparast:fix/maysam-css-selector-raw-1484

hafezparast commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

hafezparast commented Mar 12, 2026

Summary

Changes

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant