JedAI constitutes an open source, high scalability toolkit that offers out-of-the-box solutions for any data integration task, e.g., Record Linkage, Entity Resolution and Link Discovery. At its core lies a set of domain-independent, state-of-the-art techniques that apply to both RDF and relational data. These techniques rely on an approximate, schema-agnostic functionality based on (meta-)blocking for high scalability. You can read more about JedAI here and you can find the source code in this repository
JedAI-WebApp is a GUI developed with Spring (boot+ MVC) and ReactJS that facilitates the execution of JedAI. It enables the user to construct its desired workflow by sequentially selecting the algorithm(s) of each step. Furthermore, JedAI-WebApp provides the following capabilities
- Multiple data input interfaces
- Data (entities) Exclusion
- Data Exploration
- Automatic configuration of the algorithms' parameters. User can specify the values of the parameters or he can leave them to the system to detect which parameters produce the best results. The detection of the ideal parameters is performed by Grid Search or by Random Search.
- Detailed Results and display of the logs
- Exploration of the data and results.
Furthermore, it facilitates the benchmarking of different workflows or configurations over a particular dataset through the workbench window, which summarizes the outcome of all runs and maintains details about the performance and the configuration of every step.
You can either build from source or you can dowload the available Docker image here.
After installing Docker on your machine, type the following commands:
docker pull gmandi/jedai-webapp
docker run -e JAVAOPTIONS=‘-Xmx4g’ -p 8080:8080 -v/absolute/path gmandi/jedai-webapp
Then, open your browser and go to localhost:8080. JedAI should be running on your browser!
Building from source, requires Java 8, Maven 3. and npm.
In the src/main/resources/applications.properties file, set the fields
spring.datasource.username = <username\>
spring.datasource.password= <password\>
These will be the credential that you will use to login in the h2-console. Furthermore, by default, Spring Boot configures the application to connect to an in-memory store, which means that database is volatile and data will be lost when we restart the application. In order to change this behavior, you can use file-based storage by setting the property
spring.datasource.url=jdbc:h2:file:<absolute_path_to_file>
Then start JedAI-WebApp by executing
./mvnw spring-boot:run
and open your browser and go to localhost:8080. JedAI should be running on your browser!