pyscn

mirror of https://github.com/ludo-technologies/pyscn.git synced 2025-10-06 00:59:45 +03:00

Author	SHA1	Message	Date
DaisukeYoda	2da26eda28	chore: format code with go fmt Run go fmt on all Go files to ensure consistent formatting 🤖 Generated with Claude Code Co-Authored-By: Claude <noreply@anthropic.com>	2025-09-13 15:51:51 +09:00
DaisukeYoda	245e75788c	test: update test expectations for unified report format - Update E2E tests to expect "Clone Detection Analysis Report" instead of "Clone Detection Results" - Update E2E tests to expect "Dead Code Analysis Report" instead of "Dead Code Detection Results" - Update severity expectations from "CRITICAL" to "High" in dead code tests - Update statistics label expectations in integration tests - All tests now pass with the unified report format 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-09-11 00:26:15 +09:00
DaisukeYoda	6b50848f75	refactor: rename project from pyqol/pyscan to pyscn - Update all references from pyqol and pyscan to pyscn - Rename directories: cmd/pyqol → cmd/pyscn, python/src/pyscan → python/src/pyscn - Update Go module path to github.com/ludo-technologies/pyscn - Update PyPI package name to pyscn - Update all documentation and configuration files - Successfully published to PyPI as pyscn This change provides a shorter, more memorable package name (5 characters) that is easier to type in commands like 'pyscn analyze' while maintaining the core meaning of Python code scanning/analysis.	2025-09-08 01:31:33 +09:00
DaisukeYoda	8f26e18f8a	refactor: completely remove ProgressReporter infrastructure Remove all progress reporting infrastructure from the codebase as it was architectural debt that provided misleading feedback to users. Progress bars were tracking file parsing (fast O(n) operation) instead of actual computation-intensive algorithms like clone detection (O(n²)). Major changes: - Remove ProgressReporter interface from domain layer - Delete service/progress_reporter.go and all implementations - Update all service classes to remove progress fields and parameters - Update all use case classes and builders to remove progress dependencies - Clean up all command layer progress reporter creation - Fix all test files to remove progress-related mock objects and assertions - Remove progress parameters from 25+ files across the codebase This simplifies the architecture and removes complexity without losing any meaningful functionality. Future progress reporting should target the actual heavy computational algorithms, not file I/O operations. All tests pass and build succeeds after this comprehensive refactoring. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-09-06 17:24:18 +09:00
Daisuke Yoda	87edf5531f	test: fix context cancellation test to handle wrapped errors Updated TestComplexityCleanCancellation to use errors.Is() for checking context.Canceled in error chain, properly handling wrapped errors from the service layer. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-23 01:49:15 +09:00
Daisuke Yoda	08a2c74148	fix: resolve linting issues and improve code quality - Remove ineffectual assignment in analyze.go context handling - Fix Args function in check.go to avoid ineffectual slice modification - Remove unused context variable in check.go - All golangci-lint checks now pass - All tests continue to pass 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-21 13:10:32 +09:00
Daisuke Yoda	83cf3ebd4e	feat: add comprehensive input validation and configuration consolidation ## Input Validation Improvements - Add input validation to CloneService.ComputeSimilarity with empty fragment checks, size limits (1MB), and nil context validation - Add validation to DetectClones and DetectClonesInFiles methods with comprehensive parameter checks - Add nil checks to all APTED tree operations (ComputeSimilarity, computeApproximateDistance) - Add parser result validation with nil checks for AST nodes and parse results ## Configuration Consolidation - Create unified CloneConfig structure with logical sections (Analysis, Thresholds, Filtering, Input, Output, Performance) - Centralize clone type thresholds (0.95, 0.85, 0.70, 0.60) into constants package - Add comprehensive validation for all configuration sections with detailed error messages - Maintain backward compatibility through adapter functions ## Memory Management Enhancements - Implement O(n²) algorithm with batching and memory limits (10,000 pairs max, 100MB limit) - Add priority queue-based filtering for large datasets with early similarity filtering - Implement recursive depth limits (1000 levels) to prevent stack overflow in tree operations ## Test Coverage - Add comprehensive validation tests for all service methods - Add nil check tests for APTED analyzer operations - Add configuration adapter tests with roundtrip conversion verification - All tests passing with 100% validation coverage 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-20 17:37:28 +09:00
Daisuke Yoda	cbeee6c746	feat: complete clone detection implementation with APTED algorithm This comprehensive implementation replaces placeholder code with a fully functional clone detection system using the APTED (All Path Tree Edit Distance) algorithm for Python code analysis. ## Major Features Added ### 🔬 APTED Algorithm Implementation - Core APTED algorithm with tree edit distance calculation - Configurable cost models for insert, delete, and rename operations - Optimized performance: <1s for 1000-node trees (was 26.6s) - Support for all clone types (Type-1 through Type-4) ### 🏗️ Complete Architecture - Domain layer with clean interfaces and validation - Service layer with proper error handling - CLI integration with comprehensive options - Configuration system supporting YAML/JSON - Output formatters for JSON, YAML, CSV, and text ### 🐍 Python AST Integration - Tree-sitter Python parser integration - AST to tree node conversion for APTED - Proper handling of Python syntax constructs - Support for functions, classes, and complex expressions ## Key Bug Fixes ### ❌ Replaced Magic Number 0.7 Before: `return 0.7, nil // placeholder` After: Full APTED-based similarity calculation with real parsing ### 🔧 Fixed All TODO Items - complexity_reporter.go:171: Use version package instead of hardcoded "dev" - complexity_reporter.go:173: Track actual file count vs hardcoded 1 - dead_code_test.go:168: Implement all-paths-return detection infrastructure - clone_service.go: Evaluate progress reporter interface consistency ### 🔌 Connected All Components - Integrated Python parser (removed commented placeholders) - Connected service layer to analyzer layer - Implemented real file I/O (replaced fake content) - Added proper error handling throughout ## Enhanced Dead Code Detection - Advanced reachability analysis with all-paths-return detection - Enhanced CFG traversal algorithms - Better handling of complex control flow scenarios - Infrastructure for sophisticated dead code analysis ## Performance Optimizations - APTED algorithm optimized for large trees - Early termination for size-different trees - Efficient tree traversal with memoization - Memory-efficient data structures ## Testing & Quality - 200+ tests all passing - Comprehensive integration tests - Performance benchmarks included - Real-world functional testing verified ## File Changes Summary New Files (13): - `app/clone_usecase.go` - Application layer use case - `cmd/pyqol/clone.go` - CLI command implementation - `domain/clone.go` - Core domain types and validation - `domain/clone_test.go` - Domain layer tests - `integration/clone_integration_test.go` - End-to-end tests - `internal/analyzer/apted.go` - APTED algorithm implementation - `internal/analyzer/clone_detector.go` - Clone detection logic - `service/clone_.go` - Service layer implementation Modified Files (9)*: - Enhanced version management in complexity reporter - Added file count tracking - Improved dead code analysis - Extended configuration support - Updated main CLI integration ## Impact Transforms pyqol from a stub system with hardcoded values into a production-ready Python code analysis tool with state-of-the-art clone detection capabilities. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-20 12:41:52 +09:00
DaisukeYoda	96f20973ce	feat: implement clean architecture for complexity analysis - Add domain layer with interfaces and models - Implement application layer with ComplexityUseCase - Create service layer implementations - Refactor CLI to be a thin wrapper - Add comprehensive integration tests - Support .pyi files and progress reporting - Implement multiple output formats (JSON, YAML, CSV, text) This resolves test infrastructure issues and improves maintainability through proper separation of concerns. Fixes #37 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-08-17 09:14:12 +09:00

9 Commits