softwareclonesorg

University of BremenSoftware Engineering Group

Research Data

Here you can find various data related to our research, including corpora, results as well as scripts necessary for reproduction. We hope for and very much appreciate contributions directed on the same topics aiming to extend the existing data set.

We currently offer data sets to the following publications of ours:

SE 2022

Paper

R. Koschke, M. Steinbeck. How EvoStreets Are Observed in Three-Dimensional and Virtual Reality Environments. Software Engineering 2022, Lecture Notes in Informatics (LNI).

Data

Here you can download data gathered in the experiment. The README.md file enclosed describes the details.

IWSC 2016

Paper

R. Koschke, S. Bazrafshan. A Large-Scale Statistical Analysis of Software Clone Rates and Localization in Open-Source Programs Written in C, C++, C#, or Java. International Workshop on Software Clones, IEEE Computer Society, 2016.

Data

Coming soon!

ICPC 2012

Paper

J. Harder, R. Tiarks. A Controlled Experiment on Software Clones. International Conference on Program Comprehension, IEEE Computer Society Press, 2012; 219-228.

Data

Here you can download data used in the experiment. The downloads include the results of the original study, a replication set consisting of various materials, the Eclipse Plugin developed to observe user interaction, and the sources of the two subject games.

WCRE 2011

Paper

S. Bazrafshan, R. Koschke, N. Göde. Approximate Code Search in Program Histories. Working Conference on Reverse Engineering, IEEE Computer Society Press, 2011; 109-118.

Data

Here you can find the detailed results of the performance analysis of our tool approx.

TSE 2007

Paper

S. Bellon, R. Koschke, G. Antoniol, J. Krinke, E. Merlo. Comparison and Evaluation of Clone Detection Tools. IEEE Computer Society Transactions on Software Engineering September 2007; 33(9):577-591.

Data

Using the first link you can download the data of Stefan Bellon's clone detector benchmark in RCF format. For each system that was analyzed in the main experiment, a rcf file is present. To load the files please unpack the source tarball in the directory where the rcf file is located. The results of each clone detector, as well as the candiates are stored in seperate versions of the rcfs. Details on the sources of the systems as an ISO image can be found using the second link.