
Code Similarity Investigator Tool (CSI)
- Project
- 20023 SmartDelta
- Type
- New product
- Description
Code Similarity Investigator (CSI) is a tool that helps developers identify similar code across large projects. It uses advanced graph-based analysis to detect both syntactic and semantic similarities, making it easier to find reusable components and reduce redundant code. CSI allows developers to prioritize areas for refactoring and testing, saving time and improving code quality. Its accurate results, closely aligned with developer judgment, help streamline maintenance and development efforts, making software easier to manage and evolve.
- Contact
- Emanuel Ramneantu (TWT Gmbh)
- emanuel.ramneantu@twt-gmbh.de
- Technical features
This software features a Fast API interface for seamless integration and a Docker container for convenient deployment. It employs a custom Graph Edit Distance algorithm for semantic code comparisons, leveraging Joern for robust parsing of syntax, control flow, and data dependencies.
- Integration constraints
This solution requires read access to source code for building and analyzing code graphs. Configuration typically takes a few days for setting up graph generation and fine-tuning thresholds. Minimal developer training is recommended for interpreting analysis results and adjusting similarity parameters. Integration with common build tools (e.g., Maven, Gradle) is future work.
- Targeted customer(s)
Engineering and consulting customers from any domain, that develop code in git repositories.
- Conditions for reuse
Licensing and permission required
- Confidentiality
- Public
- Publication date
- 03-02-2025
- Involved partners
- TWT GmbH Science & Innovation (DEU)
- Software AG (DEU)
- Vaadin (FIN)
Images
