Skip to content

isawnyu/concordia.matchtool

Repository files navigation

Read me for concordia.matchtool
=====================

concordia.matchtool is a framework for defining and executing rulesets to 
effect matching of records in two datasets. The original use case is getting
matches between print-style BAtlas citations held in the IAph2007 
inscriptions collection and Barrington Atlas IDs. Another early use case
is matching content extracted from the Barrington Atlas Directory
compilation files against feature data digitized from Barrington Atlas
compilation maps.

Rights, origins and credit
================

concordia.matchtool was written by Tom Elliott for the Institute for the 
Study of the Ancient World under the rubric of the Concordia project 
(http://concordia.atlantides.org) with funding provided by the National 
Endowment for the Humanities.

Copyright (c) 2008 Institute for the Study of the Ancient World, New York 
University

This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program (LICENSE.txt).  If not, see 
<http://www.gnu.org/licenses/>.


Configuration
=========

TBD


Dependencies
=========

See DEPENDENCIES.txt


Tests
===

To run tests, cd into the top-level package directory (idp.contenttool) and type:

%% python setup.py test


Components and using them
===================

Most modules and functions have docstrings, which explain their constituent
classes and methods. This information is not repeated here in order to avoid the
dreaded multiple-source documentation disease. 

The tests are a good way to learn how the different components work.

You can view a formatted version of the docstrings, plus additional information 
automatically gathered about subordinate and inherited classes and methods, 
using the python built-in help() function, for example:

>>> import concordia.matchtool.match as m
>>> help(m)


Aphrodisias Inscriptions Places Mentioned

>>> python alistgeo.py

writes results to concordia/matchtool/data/results.xml

it's still got rudimentary rules and reporting, but it's a start






concordia/matchtool/rules.py defines classes for building, managing and executing rulesets

concordia/matchtool/dset.py defines classes for managing datasets to be related (would/could build parsers to read data in various formats 
into this internal data structure, which rules.py then knows how to do stuff with)

concordia/matchtool/data.cfg provides configuration information (like relative or absolute paths to data files)

concordia/matchtool/tests/rule.txt contains some explanatory tests for the classes and methods in concordia/matchtool/rules.py

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages