-
Notifications
You must be signed in to change notification settings - Fork 1.2k
feat(healthomics): Extend the search capability to allow searches of adhoc s3 buckets #1948
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(healthomics): Extend the search capability to allow searches of adhoc s3 buckets #1948
Conversation
…s as well as configured buckets.
Codecov Report❌ Patch coverage is Additional details and impacted files@@ Coverage Diff @@
## main #1948 +/- ##
=======================================
Coverage 90.76% 90.76%
=======================================
Files 860 860
Lines 64522 64600 +78
Branches 10464 10476 +12
=======================================
+ Hits 58563 58636 +73
- Misses 3653 3656 +3
- Partials 2306 2308 +2 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
alxawan
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me!
src/aws-healthomics-mcp-server/awslabs/aws_healthomics_mcp_server/utils/validation_utils.py
Show resolved
Hide resolved
...thomics-mcp-server/awslabs/aws_healthomics_mcp_server/search/genomics_search_orchestrator.py
Show resolved
Hide resolved
a-li
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
alxawan
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Adhoc S3 Buckets Implementation
Overview
This implementation adds support for adhoc S3 bucket searching to the genomics file search tool, allowing users to search buckets that are not part of the standard configuration.
Changes Made
1. Model Updates (
models/search.py)adhoc_s3_buckets: Optional[List[str]]field toGenomicsFileSearchRequest2. Validation Utilities (
utils/validation_utils.py)validate_adhoc_s3_buckets()functionvalidate_bucket_access()from s3_utils3. Search Orchestrator (
search/genomics_search_orchestrator.py)_get_all_s3_bucket_paths()method to combine configured and adhoc buckets_search_s3_with_timeout_for_buckets()for bucket-specific searches_search_s3_paginated_with_timeout_for_buckets()for paginated searches4. Tool Function (
tools/genomics_file_search.py)adhoc_s3_bucketsparameter with comprehensive documentationFeatures
Security & Validation
Performance
Compatibility
Usage Examples
Basic Usage
Multiple Adhoc Buckets
With Pagination
Error Handling
Testing
The implementation includes comprehensive validation and testing:
Benefits
Checklist
If your change doesn't seem to apply, please leave them unchecked.
Is this a breaking change? (Y/N)
N
RFC issue number:
Checklist:
Acknowledgment
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of the project license.