id author title date pages extension mime words sentences flesch summary cache txt work_nrnit4emvfbfhaqenwnkv7lyla David F. Barrero Adapting Searchy to extract data using evolved wrappers 2012 12 .pdf application/pdf 8332 740 65 Searchy, an agent-based mediator system specialized in data extraction and integration. achieve this, a Genetic Algorithm (GA) is used to learn a regex able to extract a set of positive samples while rejects a set of negative Multiagent System (MAS) to generate a composed regular expression able to extract records that match with a training set, able to evolve a simple regex thought a VLGA with an alphabet automatically generated and extract records matching the a complex wrapper able to extract data by means of evolutionary regular expressions. The Control Module sets the flow of operations that the different elements involved in the integration must perform, including the wrappers, the Mapping Module, and the Integration Module. most useful wrappers supported by Searchy is the regex wrapper, which is able to extract data from unstructured documents. Figure 2: Example of the integration process in Searchy, with two data sources, one relational database and a directory. ./cache/work_nrnit4emvfbfhaqenwnkv7lyla.pdf ./txt/work_nrnit4emvfbfhaqenwnkv7lyla.txt