Breaking the Multi-Colored Box is a framework written in Java which is designed to test various techniques of generating known CAPTCHA and then attempting to break them using image filtering, segmentation and character recognition.
BMCB was part of my thesis research and never made it out of alpha phase. It has a basic set of filters based around the ImageMagick framework and some rudimentary analysis based on standard open-source Optical Character Recognition (OCR) applications. It’s a good framework, but I have no plans to develop it further at this time.
BMCB is licensed under the GNU GPLv3. It is free to use, modify and redistribute so long as the original GPL license is maintained.
Bmcb-0.5a.tar.bz2 (Alpha 0.5 Source Code)
Developer Documentation 0.5alpha