root/BADataMunger/trunk/README.txt

Revision 1444, 3.1 kB (checked in by thomase, 2 months ago)

code and changes for converting raw directory html in well-formed xhtml (cleaned up) and a semantic xml that combines elements of our old frankenformat and the more recent batlas id schema

Line 
1 Read me for BADataMunger
2 =================
3
4 BADataMunger is a collection of routines intended for manipulating, mining
5 and querying Classical Atlas Project compilation data (i.e., Barrington
6 Atlas compilation data) for upload to Pleiades or other functions. Details of
7 major components are provided below. Please note, some code is out of
8 date. Other code is ugly. Hopefully someday soon we won't need this stuff
9 anymore.
10
11 Rights, origins, credit and blame
12 ====================
13
14 Much of the code in BADataMunger was written by Tom Elliott for either
15 the Ancient World Mapping Center or the Institute for the Study of the
16 Ancient World (ISAW) under the rubric of the Pleiades Project. Some funding
17 for this work was provided by a grant from the National Endowment for the
18 Humanities for the initial development and interoperability prototyping
19 of Pleiades; additional funding has been provided by ISAW itself.
20
21 Copyright (c) 2008 Institute for the Study of the Ancient World, New York
22 University and Ancient World Mapping Center, University of North
23 Carolina at Chapel Hill
24
25 This program is free software: you can redistribute it and/or modify
26 it under the terms of the GNU General Public License as published by
27 the Free Software Foundation, either version 3 of the License, or
28 (at your option) any later version.
29
30 This program is distributed in the hope that it will be useful,
31 but WITHOUT ANY WARRANTY; without even the implied warranty of
32 MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
33 GNU General Public License for more details.
34
35 You should have received a copy of the GNU General Public License
36 along with this program (LICENSE.txt).  If not, see <http://www.gnu.org/licenses/>.
37
38 BADataMunger contains helper functions for determining calculating
39 permutations (xpermutations.py). It was written by Ulrich Hoffmann and
40 distributed via ActiveState Code (formerly the Oreilly/Activestate Python Cookbook).
41 At the time of download, it was marked "public domain", and Activestate now
42 applies the terms of the Python License  ( http://www.python.org/psf/license/ ).
43
44 Dependencies
45 =========
46
47 See DEPENDENCIES.txt
48
49
50 Tests
51 ====
52
53 There are no comprehensive tests. See "blame" above.
54
55
56 Components and using them
57 ===================
58
59 Some modules, classes and functions have docstrings. >>>help(modulename) will
60 sometimes provide useful information.
61
62 Here are a few of the main tasks supported herein:
63
64 Creating Barrington Atlas IDs
65
66     module: batlaspipe.py
67    
68    
69 Converting BA Directory files to XHTML or home-brewed XML that includes BAtlas
70 IDs
71
72     module: dir2xml.py
73     deficiencies:
74     * does not currently calculate descriptions as expected for old-school Pleiades
75     * there are problems with roads (some information not coming through?)
76     * not all creators contributors are in the config files?
77     * maybe id-stamp rows in XHTML with old-style ids so we can link up with
78        parsed data?
79
80
81 Parsing out bibliography from directory files
82
83     module: pipebiblio.py
84    
85    
86 Parsing feature information out of directory files and munging it together with geodata
87
88     module: pipegeo.py
89    
90    
91 Older or supporting modules may or may not work.
92
93
94
Note: See TracBrowser for help on using the browser.