What's New in v0.7.0
🚀 Major Feature: Word Count Splitting
Added support for splitting large output files by maximum word count. This feature helps manage output size when converting large repositories.
Features:
- maximum_word_count_per_file: New configuration option in .repo-to-text-settings.yaml
- Automatically splits output into multiple files when content exceeds the specified word limit
- Split files are named using the convention 'output_filename_part_N.txt'
- Maintains file structure integrity when splitting
Configuration Example:
# Optional: Maximum word count per output file
maximum_word_count_per_file: 10000
Improvements & Fixes
- Enhanced error handling in process_line function
- Improved display path in save_repo_to_text output
- Added comprehensive test coverage for word count splitting
- Fixed pylint warnings and improved code quality
- Refactored logging statements for better readability
- Updated CI/CD pipeline with better test coverage
🧪 Testing
- Added extensive test suite for word count splitting functionality
- Tests for edge cases including very small word limits
- Validation for invalid configuration values
- Fixed multiple test assertion issues identified during CI
📖 Documentation
- Updated README.md with detailed word count splitting documentation
- Added configuration examples and usage patterns
- Improved settings file documentation
Technical Changes
- New
generate_output_content
function for handling content splitting - Enhanced
load_additional_specs
for configuration validation - Updated save functionality to handle multiple output files
- Improved type annotations throughout codebase
🏗️ Infrastructure
- Fixed GitHub Actions CI pipeline
- Enhanced pylint configuration and error handling
- Updated development dependencies
- Improved test reliability and debugging output
Thanks to @Zhan-Li
Full Changelog: v0.6.0...v0.7.0