ITEA is the Eureka Cluster on software innovation
ITEA is the Eureka Cluster on software innovation
ITEA 4 page header azure circular

Code Similarity Investigator Tool (CSI)

Project
20023 SmartDelta
Type
New product
Description

Code Similarity Investigator (CSI) is a tool that helps developers identify similar code across large projects. It uses advanced graph-based analysis to detect both syntactic and semantic similarities, making it easier to find reusable components and reduce redundant code. CSI allows developers to prioritize areas for refactoring and testing, saving time and improving code quality. Its accurate results, closely aligned with developer judgment, help streamline maintenance and development efforts, making software easier to manage and evolve.

Contact
Emanuel Ramneantu (TWT Gmbh)
Email
emanuel.ramneantu@twt-gmbh.de
Technical features

This software features a Fast API interface for seamless integration and a Docker container for convenient deployment. It employs a custom Graph Edit Distance algorithm for semantic code comparisons, leveraging Joern for robust parsing of syntax, control flow, and data dependencies.

Integration constraints

This solution requires read access to source code for building and analyzing code graphs. Configuration typically takes a few days for setting up graph generation and fine-tuning thresholds. Minimal developer training is recommended for interpreting analysis results and adjusting similarity parameters. Integration with common build tools (e.g., Maven, Gradle) is future work.

Targeted customer(s)

Engineering and consulting customers from any domain, that develop code in git repositories.

Conditions for reuse

Licensing and permission required

Confidentiality
Public
Publication date
03-02-2025
Involved partners
TWT GmbH Science & Innovation (DEU)
Software AG (DEU)
Vaadin (FIN)

Images