1
0
mirror of https://github.com/jfilter/clean-text.git synced 2021-09-19 22:32:58 +03:00

Commit Graph

  • 66921d3abd 0.5.0 master 0.5.0 Johannes Filter 2021-08-31 22:37:55 +02:00
  • 4cfbfb9cfd Switch from Travis to GHA for tests Johannes Filter 2021-08-31 21:46:50 +02:00
  • 2f297db119 Format pyproject.toml Johannes Filter 2021-08-31 21:46:35 +02:00
  • cdeea8e137 Fix accidental removal of accents (close #17) Johannes Filter 2021-08-26 02:08:40 +02:00
  • 490a0deb26 Add nlpretext to related work Johannes Filter 2021-07-31 13:47:33 +02:00
  • a1849c3ceb 0.4.0 0.4.0 Johannes Filter 2021-04-12 20:05:08 +02:00
  • 910df7bc6c formatting Johannes Filter 2021-04-12 20:04:31 +02:00
  • 5fcf48f27b update deps Johannes Filter 2021-04-12 20:01:14 +02:00
  • 22014c008a fix emoji removal Johannes Filter 2021-04-12 19:56:57 +02:00
  • 90946c2cc8 fix wild whitespace bugs Johannes Filter 2021-02-16 22:33:32 +01:00
  • 00cad45b1c Merge branch 'master' of github.com:jfilter/clean-text Johannes Filter 2021-02-15 23:20:12 +01:00
  • 1c15db66c9 add option to keep 2 newlines + strip lines Johannes Filter 2021-02-15 23:20:03 +01:00
  • ff88579088 fix emoji import Johannes Filter 2021-02-15 23:16:56 +01:00
  • c791d36ab6 add related work Johannes Filter 2020-11-11 22:27:41 +01:00
  • 6f6edb5503 add related work Johannes Filter 2020-11-01 21:46:13 +01:00
  • ba9f9e6486 0.3.0 0.3.0 Johannes Filter 2020-10-18 01:29:58 +02:00
  • c5b9bcc643 make it possible to use it as clean-text Johannes Filter 2020-10-18 01:28:31 +02:00
  • 375e049f5b remove obsolete variable Johannes Filter 2020-10-18 01:27:22 +02:00
  • 355bc7d0bc make to_ascii work when unidecode isn't installed Johannes Filter 2020-10-18 01:17:45 +02:00
  • 4a2de47a59 choose simpler ascii example so it works with both variants Johannes Filter 2020-10-18 00:52:40 +02:00
  • e6a0f8212e test with presence and absence of unidecode Johannes Filter 2020-10-18 00:45:57 +02:00
  • 45a3d0df6e set specific parameter to keep or remove emojis (close #11) Johannes Filter 2020-10-18 00:45:12 +02:00
  • 67e343dd08 improve READMe Johannes Filter 2020-10-17 23:20:30 +02:00
  • 8a688b8d1f make it possible to replace punctuations (close #12) Johannes Filter 2020-10-17 21:47:17 +02:00
  • d364b3e2a9 make normalization of whitespace optional (close #13) Johannes Filter 2020-10-16 22:55:26 +02:00
  • d6d9120dca improve email regex (fix #14) Johannes Filter 2020-10-16 00:43:34 +02:00
  • 335ab64820 improve phone regex (fix #10) Johannes Filter 2020-10-16 00:18:06 +02:00
  • d32aa94ad2 0.2.1 0.2.1 Johannes Filter 2020-07-24 22:40:25 +02:00
  • c92b6315a9 display README on pypi Johannes Filter 2020-07-24 22:40:02 +02:00
  • a34f6c80c2 0.2.0 0.2.0 Johannes Filter 2020-07-24 22:34:02 +02:00
  • ffe2b7b205 improve README Johannes Filter 2020-07-24 22:30:14 +02:00
  • 58b269c6b8 remove localhost urls (fix #8) Johannes Filter 2020-07-24 22:26:19 +02:00
  • 36a8213cd3 minor improvements Johannes Filter 2020-07-24 21:51:07 +02:00
  • 5077c4fa28 case insensitive lang param Johannes Filter 2020-07-24 21:50:34 +02:00
  • 28d379b478 don't test for 3.5 on travis (cause it requires a setup.py) Johannes Filter 2020-07-24 21:47:48 +02:00
  • a1990a2287 switch to poetry Johannes Filter 2020-07-24 21:29:48 +02:00
  • b656e3b497 add missing __version__ Johannes Filter 2020-07-23 20:11:52 +02:00
  • 0a49d23bc2 improve license description Johannes Filter 2020-07-23 20:11:28 +02:00
  • 3f8b1a2725 add sponsoring Johannes Filter 2019-11-16 01:24:56 +01:00
  • 52956f8536 typo Johannes Filter 2019-04-29 16:10:20 +02:00
  • 570311a37b improve README Johannes Filter 2019-04-24 22:30:04 +02:00
  • 357da86e6f 0.1.1 0.1.1 Johannes Filter 2019-04-24 22:19:06 +02:00
  • 76fa3f0cdf improve README Johannes Filter 2019-04-24 22:18:38 +02:00
  • ba7a4cceca fix setup Johannes Filter 2019-04-24 22:18:21 +02:00
  • d3db78bd9a typo Johannes Filter 2019-04-24 22:17:29 +02:00
  • 1054e8d039 empty string if input is None Johannes Filter 2019-04-24 21:49:17 +02:00
  • 4fd108bb63 0.1.0 0.1.0 Johannes Filter 2019-04-24 18:40:20 +02:00
  • 5f7a32f1f6 further improve docs Johannes Filter 2019-04-24 18:35:36 +02:00
  • 220b63fe87 improve docs Johannes Filter 2019-04-24 18:21:27 +02:00
  • c813fc4df0 improve README Johannes Filter 2019-03-25 11:52:22 +01:00
  • 4d77cc7d6d add more arguments Johannes Filter 2019-03-22 22:43:06 +01:00
  • bde9ca974b simplify remove punct Johannes Filter 2019-03-22 22:42:51 +01:00
  • b6306daeaa add special german handling Johannes Filter 2019-03-22 22:21:52 +01:00
  • abafa9b971 fix license Johannes Filter 2019-03-22 20:26:13 +01:00
  • adde084782 improve code quality Johannes Filter 2019-03-22 20:09:05 +01:00
  • 760fd80237 create strange quote regex Johannes Filter 2019-03-22 19:59:26 +01:00
  • e8eebee3f0 clean python2 heritage Johannes Filter 2019-03-22 19:52:30 +01:00
  • 29ef382136 make unidecode optional Johannes Filter 2019-03-22 19:48:34 +01:00
  • fee738833a fix pip file Johannes Filter 2019-03-22 19:46:13 +01:00
  • 58e92cbda3 improve URL regex Johannes Filter 2019-02-09 18:27:47 +01:00
  • b471f7ffe2 remove unnecessary functions Johannes Filter 2019-02-09 18:26:44 +01:00
  • 994b7e35f1 add option to remove line breaks alltogether Johannes Filter 2019-01-11 17:47:26 +01:00
  • 4ee2a244b8 simply replace all numbers with 0 Johannes Filter 2019-01-08 13:10:07 +01:00
  • b312538cea fix test Johannes Filter 2019-01-03 20:58:35 +01:00
  • 1e6a59f271 don't fail on decoding error Johannes Filter 2018-12-21 22:06:36 +01:00
  • 2449a24bd8 fix Johannes Filter 2018-12-21 22:01:56 +01:00
  • 5ad0828792 add missing parts Johannes Filter 2018-12-21 21:57:21 +01:00
  • 370e9c039c typo Johannes Filter 2018-12-21 21:53:07 +01:00
  • 8dabf21a81 add travis Johannes Filter 2018-12-21 21:52:55 +01:00
  • cd75b43b7a don't fear the labor Johannes Filter 2018-12-21 21:48:17 +01:00
  • d0c1eb6077 getting stuff done Johannes Filter 2018-12-21 21:18:08 +01:00
  • 68905c51ed Initial commit Johannes Filter 2018-12-06 23:54:16 +01:00