1
1
mirror of https://github.com/mrabarnett/mrab-regex.git synced 2025-10-06 06:12:38 +02:00
Commit Graph

84 Commits

Author SHA1 Message Date
Matthew Barnett
667f171a0b Updated version for new release. 2022-01-18 18:18:44 +00:00
Matthew Barnett
dde2d98360 Git issue 443: 2021.11.9 source release is missing C headers
Updated version.
2021-11-09 22:16:38 +00:00
Matthew Barnett
1c30637ec7 Git issue 442: Fuzzy regex matching doesn't seem to test insertions correctly 2021-11-09 19:43:42 +00:00
Matthew Barnett
6ce0bda712 Git issue 435: Unmatched groups: sub vs subf
A similar fix also applies to expandf: unmatched groups should expand to an empty string.
2021-11-02 17:13:53 +00:00
Matthew Barnett
f2c5da72e3 Further changes for migration to Github. 2021-11-01 19:27:48 +00:00
Matthew Barnett
bf5e239c0e Git issue 433: Disagreement between fuzzy_counts and fuzzy_changes
Fuzzy changes were sometimes not removed when backtracking.
2021-10-23 00:10:40 +01:00
Matthew Barnett
26d37df1c3 Removed Apple Silicon build from .travis.yml because it's not currently codesigned by Travis CI. 2021-10-21 02:21:13 +01:00
Matthew Barnett
d81009be69 Git issue 428: match hangs on the following example - possible infinite loop?
Fixed miscalculation of total error count when there's more than one fuzzy term.
2021-10-08 23:56:28 +01:00
Matthew Barnett
25638c20a4 Git issue 427: Possible bug with BESTMATCH 2021-09-30 23:10:38 +01:00
Matthew Barnett
3dd42455df Updated to Unicode 14.0.0. 2021-09-24 21:07:41 +01:00
Matthew Barnett
e3b477cc93 Git issue 421: 2021.8.27 results in "Fatal Python error: Segmentation fault"
Fixed problems with use of fast searching tables in opposite direction.
2021-08-27 19:39:30 +01:00
Matthew Barnett
ac7ce3f5ee Git issue 420: segmentation fault in finditer (maybe others)
Fixed a bugs in fast searches in reverse direction.
2021-08-27 01:40:23 +01:00
Matthew Barnett
75211751d9 Updated version. 2021-08-21 21:22:35 +01:00
Matthew Barnett
26a320f29b Forgot to update version! 2021-08-03 18:41:15 +01:00
Matthew Barnett
ae6bb1ebd3 Additional fix for Git issue 415. 2021-07-06 00:03:35 +01:00
Matthew Barnett
5d6f9cb115 Git issue 415: Fuzzy character restrictions don't apply to insertions at "right edge" 2021-07-05 20:41:03 +01:00
Matthew Barnett
32453c1378 Git issue 407: API is not a drop-in replacement for python's re when it comes to typing
Now exports Match object as well as Pattern object.

Git issue 414: Memory optimization questions

sys.getsizeof returns a more accurate size of a pattern object. It includes the size of internal data, but, as is the norm, does not include the size of public objects.
2021-07-01 23:27:31 +01:00
Matthew Barnett
1e6986b92f Git issue 408: regex fails with a quantified backreference but succeeds with repeated backref
Git issue 407: API is not a drop-in replacement for python's re when it comes to typing
2021-04-04 17:48:38 +01:00
Matthew Barnett
0321186b78 Git issue 403: Fuzzy matching with wrong distance (unnecessary substitutions)
Reworked the fuzzy matching code.
2021-03-17 20:11:27 +00:00
Matthew Barnett
5de64f7553 Git issue 394: Unexpected behaviour in fuzzy matching with limited character set with IGNORECASE flag 2020-11-13 01:54:29 +00:00
Matthew Barnett
d5a5016c1b Update version. 2020-11-11 16:13:28 +00:00
Matthew Barnett
b693a1fba7 Git issue 362: Any LICENSE work for this project?
Changed licence to Apache 2.0 and added licence file.
2020-10-28 22:28:14 +00:00
Matthew Barnett
92989b561a Git issue 387: Compilaton flag to avoid storing compiled regexp in internal cache
Slight reversion/revision. You can prevent explicitly-compiled patterns from being cached by using "cache_all(False)".
2020-10-23 03:01:51 +01:00
Matthew Barnett
22c5f461b4 Git issue 387: Compilaton flag to avoid storing compiled regexp in internal cache
No longer caches patterns that are compiled explicitly.
2020-10-22 23:43:57 +01:00
Matthew Barnett
fa9def53cf Git issue 386: GCC 10 warnings
Fixed bugs in fuzzy_match_string_fld and fuzzy_match_group_fld.

Added more braces around data in some Unicode tables.
2020-10-15 13:27:10 +01:00
Matthew Barnett
818685f09c Git issue 385: Comments in expressions
Didn't parse regex comments property when in VERBOSE mode.
2020-10-11 03:20:49 +01:00
Matthew Barnett
5c657a4473 Git issue 383: Memory Error - regex.findall
The problem was caused by a lazy repeat looping forever, growing the backtracking stack. Greedy repeats were OK.
2020-09-27 02:43:56 +01:00
Matthew Barnett
fe9fb05890 Git issue 377: request: \h for horizontal space
Added \h as an alias to [[:blank:]].
2020-07-14 23:44:08 +01:00
Matthew Barnett
fb025ba271 Git issue 376: Is the \L option as efficient as it can be?
Improved performance of string sets.
2020-06-07 23:24:47 +01:00
Matthew Barnett
be28c28db9 Git issue 376: Is the \L option as efficient as it can be?
Switched StringSet to use fallback method due to inefficiencies in the engine. Needs more investigation.
2020-06-07 02:19:06 +01:00
Matthew Barnett
af87091c93 Git issue 372: Regression from 2020.4.4 -> 2020.5.7 in non-fuzzy matching pattern
Changed the 'state' member that's tested in is_repeat_guarded for a fuzzy match. The previously-used member wasn't initialised in a non-fuzzy match. The new test is a better one to use anyway.
2020-05-14 14:26:25 +01:00
Matthew Barnett
98fea72cf3 Git issue 371: Specifying character set when fuzzy-matching allows characters not in the set
fuzzy_ext_match and fuzzy_ext_match_group_fld didn't support sets!
2020-05-13 17:46:26 +01:00
Matthew Barnett
2a97a7df1f Git issue 370: Confusions about Fuzzy matching behavior (prob a bug?) 2020-05-07 18:01:07 +01:00
Matthew Barnett
c660527507 Updated to Unicode 13.0.0. 2020-04-04 21:05:28 +01:00
Matthew Barnett
2a0cb832d2 Git issue 365: Memory leak occurs in fuzzy match at some substitution use cases 2020-02-20 20:31:27 +00:00
Matthew Barnett
42ec250563 Git issue #364: Contradictory values in fuzzy_counts and fuzzy_changes 2020-02-18 23:40:21 +00:00
Matthew Barnett
0c850822f5 Issue 357: New exception "ValueError: unused keyword argument" breaks use case
Added ignore_unused keyword argument.

Issue 359: 2020.1.7 source distribution release contains \r\n line endings

Fixed line endings for source distribution.

Issue 360: Invalid modeline in `_regex.c`

Removed vim modeline.
2020-01-07 23:38:06 +00:00
Matthew Barnett
7ff4285730 Version wasn't updated! 2020-01-07 03:37:52 +00:00
Matthew Barnett
3213080a41 Fix to previous change. 2020-01-07 03:28:59 +00:00
Matthew Barnett
7b941c1eea Issue 354: Raise over unrecognised kwargs, to help protect against typos? 2020-01-07 02:58:10 +00:00
mrab
6a85dd6242 Hg issue 353: fuzzy changes negative indexes
Fuzzy change positions were off by 1 for deletions.
2019-12-21 00:25:28 +00:00
mrab
d0e072befb Another complaint from Linux. 2019-12-18 00:13:16 +00:00
mrab
9bb16cf82e New release and upload because of problem with source distribution. 2019-12-17 21:17:10 +00:00
mrab
58413142a8 New release and upload because of previous issues. 2019-12-17 18:23:27 +00:00
mrab
1650e99459 Discarded changes for Linux. 2019-12-16 21:50:15 +00:00
mrab
2a76c58aa4 Backed out changeset: f57e64d2085b 2019-12-16 21:41:08 +00:00
Zsolt Dollenstein
3972ac0043 add bdist_wheel command to setup.py 2019-12-15 11:23:05 +00:00
mrab
4df82d18d0 Hg issue 348: '\X' (extended grapheme cluster) can't pass Unicode's GraphemeBreakTest (12.1.0)
Fixed a couple of bugs in unicode_at_grapheme_boundary.
2019-12-09 19:16:20 +00:00
mrab
94c2a51d16 Made "Additional Features" linkable.
Updated setup.

Minor bug fix.
2019-10-31 22:42:49 +00:00
mrab
d6ba23d2eb Hg issue 338: specifying allowed characters when fuzzy-matching
Added character testing to a fuzzy constraint.
2019-08-19 18:26:22 +01:00