Simian (Similarity Analyser) identifies duplication in Java, C#, C, C++, COBOL, Ruby, JSP, ASP, HTML, XML, Visual Basic source code and even plain text files. In fact, simian can be used on any human readable files such as ini files, deployment descriptor
K. Stroggylos, and D. Spinellis. WoSQ '07: Proceedings of the 5th International Workshop on Software Quality, page 10. Washington, DC, USA, IEEE Computer Society, (2007)