Johannes Filter
|
66921d3abd
|
0.5.0
0.5.0
|
2021-08-31 22:37:55 +02:00 |
|
Johannes Filter
|
4cfbfb9cfd
|
Switch from Travis to GHA for tests
- drop Python 3.5 support while doing so
|
2021-08-31 22:34:08 +02:00 |
|
Johannes Filter
|
2f297db119
|
Format pyproject.toml
|
2021-08-31 21:46:35 +02:00 |
|
Johannes Filter
|
cdeea8e137
|
Fix accidental removal of accents (close #17)
|
2021-08-26 02:08:40 +02:00 |
|
Johannes Filter
|
490a0deb26
|
Add nlpretext to related work
|
2021-07-31 13:47:33 +02:00 |
|
Johannes Filter
|
a1849c3ceb
|
0.4.0
0.4.0
|
2021-04-12 20:05:08 +02:00 |
|
Johannes Filter
|
910df7bc6c
|
formatting
|
2021-04-12 20:04:31 +02:00 |
|
Johannes Filter
|
5fcf48f27b
|
update deps
|
2021-04-12 20:01:14 +02:00 |
|
Johannes Filter
|
22014c008a
|
fix emoji removal
|
2021-04-12 19:56:57 +02:00 |
|
Johannes Filter
|
90946c2cc8
|
fix wild whitespace bugs
|
2021-02-16 22:33:32 +01:00 |
|
Johannes Filter
|
00cad45b1c
|
Merge branch 'master' of github.com:jfilter/clean-text
|
2021-02-15 23:20:12 +01:00 |
|
Johannes Filter
|
1c15db66c9
|
add option to keep 2 newlines + strip lines
|
2021-02-15 23:20:03 +01:00 |
|
Johannes Filter
|
ff88579088
|
fix emoji import
|
2021-02-15 23:16:56 +01:00 |
|
Johannes Filter
|
c791d36ab6
|
add related work
|
2020-11-11 22:27:41 +01:00 |
|
Johannes Filter
|
6f6edb5503
|
add related work
|
2020-11-01 21:46:13 +01:00 |
|
Johannes Filter
|
ba9f9e6486
|
0.3.0
0.3.0
|
2020-10-18 01:29:58 +02:00 |
|
Johannes Filter
|
c5b9bcc643
|
make it possible to use it as clean-text
|
2020-10-18 01:28:31 +02:00 |
|
Johannes Filter
|
375e049f5b
|
remove obsolete variable
|
2020-10-18 01:27:22 +02:00 |
|
Johannes Filter
|
355bc7d0bc
|
make to_ascii work when unidecode isn't installed
|
2020-10-18 01:17:45 +02:00 |
|
Johannes Filter
|
4a2de47a59
|
choose simpler ascii example so it works with both variants
|
2020-10-18 00:52:40 +02:00 |
|
Johannes Filter
|
e6a0f8212e
|
test with presence and absence of unidecode
|
2020-10-18 00:45:57 +02:00 |
|
Johannes Filter
|
45a3d0df6e
|
set specific parameter to keep or remove emojis (close #11)
|
2020-10-18 00:45:12 +02:00 |
|
Johannes Filter
|
67e343dd08
|
improve READMe
|
2020-10-17 23:20:30 +02:00 |
|
Johannes Filter
|
8a688b8d1f
|
make it possible to replace punctuations (close #12)
|
2020-10-17 21:47:17 +02:00 |
|
Johannes Filter
|
d364b3e2a9
|
make normalization of whitespace optional (close #13)
|
2020-10-16 22:55:26 +02:00 |
|
Johannes Filter
|
d6d9120dca
|
improve email regex (fix #14)
|
2020-10-16 00:43:34 +02:00 |
|
Johannes Filter
|
335ab64820
|
improve phone regex (fix #10)
|
2020-10-16 00:18:06 +02:00 |
|
Johannes Filter
|
d32aa94ad2
|
0.2.1
0.2.1
|
2020-07-24 22:40:25 +02:00 |
|
Johannes Filter
|
c92b6315a9
|
display README on pypi
|
2020-07-24 22:40:02 +02:00 |
|
Johannes Filter
|
a34f6c80c2
|
0.2.0
0.2.0
|
2020-07-24 22:34:02 +02:00 |
|
Johannes Filter
|
ffe2b7b205
|
improve README
|
2020-07-24 22:30:14 +02:00 |
|
Johannes Filter
|
58b269c6b8
|
remove localhost urls (fix #8)
|
2020-07-24 22:26:19 +02:00 |
|
Johannes Filter
|
36a8213cd3
|
minor improvements
|
2020-07-24 21:51:07 +02:00 |
|
Johannes Filter
|
5077c4fa28
|
case insensitive lang param
|
2020-07-24 21:50:34 +02:00 |
|
Johannes Filter
|
28d379b478
|
don't test for 3.5 on travis (cause it requires a setup.py)
|
2020-07-24 21:47:48 +02:00 |
|
Johannes Filter
|
a1990a2287
|
switch to poetry
|
2020-07-24 21:29:48 +02:00 |
|
Johannes Filter
|
b656e3b497
|
add missing __version__
|
2020-07-23 20:11:52 +02:00 |
|
Johannes Filter
|
0a49d23bc2
|
improve license description
|
2020-07-23 20:11:28 +02:00 |
|
Johannes Filter
|
3f8b1a2725
|
add sponsoring
|
2019-11-16 01:24:56 +01:00 |
|
Johannes Filter
|
52956f8536
|
typo
|
2019-04-29 16:10:20 +02:00 |
|
Johannes Filter
|
570311a37b
|
improve README
|
2019-04-24 22:30:04 +02:00 |
|
Johannes Filter
|
357da86e6f
|
0.1.1
0.1.1
|
2019-04-24 22:19:06 +02:00 |
|
Johannes Filter
|
76fa3f0cdf
|
improve README
|
2019-04-24 22:18:38 +02:00 |
|
Johannes Filter
|
ba7a4cceca
|
fix setup
|
2019-04-24 22:18:21 +02:00 |
|
Johannes Filter
|
d3db78bd9a
|
typo
|
2019-04-24 22:17:29 +02:00 |
|
Johannes Filter
|
1054e8d039
|
empty string if input is None
|
2019-04-24 21:49:17 +02:00 |
|
Johannes Filter
|
4fd108bb63
|
0.1.0
0.1.0
|
2019-04-24 18:40:20 +02:00 |
|
Johannes Filter
|
5f7a32f1f6
|
further improve docs
|
2019-04-24 18:35:36 +02:00 |
|
Johannes Filter
|
220b63fe87
|
improve docs
|
2019-04-24 18:21:27 +02:00 |
|
Johannes Filter
|
c813fc4df0
|
improve README
|
2019-03-25 11:52:22 +01:00 |
|