Changeset 812
- Timestamp:
- 05/23/07 13:13:23 (2 years ago)
- Files:
-
- BADataMunger/trunk/wordnormalizer.py (modified) (1 diff)
Legend:
- Unmodified
- Added
- Removed
- Modified
- Copied
- Moved
BADataMunger/trunk/wordnormalizer.py
r784 r812 5 5 6 6 norms = [ 7 ('‑','-'), 8 (' ',' '), 9 (' ',' ') 7 ('‑','-'), # non-breaking hyphen 8 (' ',' '), # non-breaking space 9 (' ',' '), # non-breaking space bis 10 ('ߪ','...') # horizontal ellipsis 10 11 ] 11 12
