Information for GNU grep developers

auto

bug-grep
bug-grep
subscribe
X-BeenThere: bug-grep@gnu.org

X-Savane-Project: grep
X-Savane-Tracker: bugs

X-Savane-Project: grep
X-Savane-Tracker: patch

X-Savane-Project: grep
X-Savane-Tracker: support

grep-commit
grep-commit
subscribe
X-BeenThere: grep-commit@gnu.org

bug-gnu-utils
bug-grep
subscribe
X-BeenThere: bug-gnu-utils@gnu.org

bug-grep
CVS_RSH=ssh cvs -z3 -d:ext:anoncvs@savannah.gnu.org:/cvsroot/grep co grep

grep-commit
To: grep-cvs-logs@gnu.org

grep-commit
To: grep-cvs-diffs@gnu.org

CVS_RSH=ssh cvs -z3 -d:ext:anoncvs@savannah.gnu.org:/webcvs/grep co grep

grep-commit
To: grep-webcvs-logs@gnu.org

grep-commit
To: grep-webcvs-diffs@gnu.org

grep-commit
bug-grep
2.5.2
=====
Our main goal for grep 2.5.2 is to get sane performance with utf-8.
That can be achieved by the patches written by Tim Waugh for Red Hat.

Besides that, I can do some changes in the infrastructure, so that
I can "breathe":

1) rewrite the configure.in script, perhaps also Makefile.am
2) set up for gnulib-tool --import
3) improve the test ifrastructure

I'm afraid I have to do 1) myself, and it is closely tied with 2),
so they probably have to be done together.

If someone likes awk and wanted to help with 3), it could help.
In short, there should be only one awk script for .test-->.script
rule.  The header of each .test file should state some details,
like which command to run, eg. "grep -E".  We also heve to invent
a way to collect the test cases for non-C locales; either by
running the whole set twice, or by creating a separate .test files.
The "make check" goal should run this, if the computer has a locale
like en_US.utf8 installed.

After completing these, we can:
4) check in the patches for the sync of dfa.c with GNU awk
5) other small patches which wait for a test case
6) process the RedHat patches

After 6), I should repeat Tim's measurments and see whether the utf8
performance improved.

Independently, I'd like to see
7) some _minimal_ cleanup of the grep(), grepdir(), recursion
   (the "main loop") and fix --directories=read
8) mark the -P option clearly as "experimental";

Well, that'll be perhaps enough for a release.

2.5.3
=====
Fix the combinations:
 * -i -o
 * --colour -i
 * -o -b
 * -o and zero-width matches
Go through the bug list im my mailbox and fix fixable.
Fix bugs reported with 2.5.2.

2.6.x
=====
The following should go here:
 - upgrade to current regex.c from glibc,
 - new functionality,
 - fixes for -P,
 - heavy refactoring.

dfa.[ch]
make check
grep.pot
po
cvs -d:pserver:anoncvs@sources.redhat.com:/cvs/gettext co gettext/gettext-runtime/ABOUT-NLS

anonymous
$HOME/.cvspass
/1 :pserver:anoncvs@sources.redhat.com:2401/cvs/gettext Ay=0=a%0bZ

make dist
[= =]
[. .]
--ignore-case
0049;LATIN CAPITAL LETTER I;Lu;0;L;;;;;N;;;;0069;
0069;LATIN SMALL LETTER I;Ll;0;L;;;;;N;;;0049;;0049
0130;LATIN CAPITAL LETTER I WITH DOT ABOVE;Lu;0;L;0049 0307;;;;N;LATIN CAPITAL LETTER I DOT;;;0069;
0131;LATIN SMALL LETTER DOTLESS I;Ll;0;L;;;;;N;;;0049;;0049

U+0049
0x49
U+0069
0x69
U+0130
0xC4 0xB0
U+0131
0xC4 0xB1
lc(I) = i, uc(I) = I
lc(i) = i, uc(i) = I
lc(İ) = i, uc(İ) = İ
lc(ı) = ı, uc(ı) = I

lc()
uc()
--ignore-case
if (lc(input_wchar) == lc(pattern_wchar))

  \in  I  i  İ  ı
pat\   ----------
"I" |  Y  Y  Y  n
"i" |  Y  Y  Y  n
"İ" |  Y  Y  Y  n
"ı" |  n  n  n  Y

if (uc(input_wchar) == uc(pattern_wchar))

  \in  I  i  İ  ı
pat\   ----------
"I" |  Y  Y  n  Y
"i" |  Y  Y  n  Y
"İ" |  n  n  Y  n
"ı" |  Y  Y  n  Y

if (   lc(input_wchar) == lc(pattern_wchar)
    || uc(input_wchar) == uc(pattern_wchar))

  \in  I  i  İ  ı
pat\   ----------
"I" |  Y  Y  Y  Y
"i" |  Y  Y  Y  Y
"İ" |  Y  Y  Y  n
"ı" |  Y  Y  n  Y

if (      input_wchar  == pattern_wchar
    || lc(input_wchar) == pattern_wchar
    || uc(input_wchar) == pattern_wchar)

  \in  I  i  İ  ı
pat\   ----------
"I" |  Y  Y  n  Y
"i" |  Y  Y  Y  n
"İ" |  n  n  Y  n
"ı" |  n  n  n  Y

if (lc(uc(input_wchar)) == lc(uc(pattern_wchar)))

  \in  I  i  İ  ı
pat\   ----------
"I" |  Y  Y  Y  Y
"i" |  Y  Y  Y  Y
"İ" |  Y  Y  Y  Y
"ı" |  Y  Y  Y  Y

--ignore-case
toCasefold()
SpecialCasing.txt
CaseFolding.txt
if (toCasefold(input_wchar_string) == toCasefold(pattern_wchar_string))

toCasefold_simple(U+0049) = U+0069
toCasefold_simple(U+0069) = U+0069
toCasefold_simple(U+0130) = U+0130
toCasefold_simple(U+0131) = U+0131

  \in  I  i  İ  ı
pat\   ----------
"I" |  Y  Y  n  n
"i" |  Y  Y  n  n
"İ" |  n  n  Y  n
"ı" |  n  n  n  Y

toCasefold_full(U+0049) = U+0069
toCasefold_full(U+0069) = U+0069
toCasefold_full(U+0130) = <U+0069, U+0307>
toCasefold_full(U+0131) = U+0131

0307;COMBINING DOT ABOVE;Mn;230;NSM;;;;;N;NON-SPACING DOT ABOVE;;;;

  \in  I  i  İ  ı
pat\   ----------
"I" |  Y  Y  *  n
"i" |  Y  Y  *  n
"İ" |  n  n  Y  n
"ı" |  n  n  n  Y

toCasefold(U+0131)
U+0069
toUpperCase(U+0131)
U+0049
toCasefold_simple(U+0130)
toLowerCase(U+0131)
U+0069
toCasefold_full(U+0130)
U+0307
U+0069
echo 'AßBC | grep -i 'Sb'

input:    U+0041 U+00DF U+0042 U+0043 U+000A
pattern:  U+0053 U+0062

CaseFolding-4.1.0.txt
toCasefold()
input:    U+0061 U+0073 U+0073 U+0062 U+0063 U+000A
pattern:                U+0073 U+0062

echo 'AßBC' | grep -i --only-matching 'Sb'
echo 'AßBC' | grep -i --color=always  'Sb'

toCasefold()
towctrans()
wint_t
wint_t
-Z
-J
0x1F 0x8B
0x1F 0x9D
0x42 0x5A 0x68
tests/spencer2.tests
dfa.[ch]
regex.[ch]
#ifdef
bug-grep@gnu.org
<rvdm at debian.org>
<rmgolbeck at debian.org>
<jbailey at nisa.net>
<anibal at debian.org>
<santiago at unicauca.edu.co>
<twaugh at redhat.com>
cvs -d:pserver:anonymous@cvs.fedora.redhat.com:/cvs/dist co devel/grep
CVS_RSH=ssh cvs -d:ext:freebsdanoncvs@anoncvs.FreeBSD.org:/home/ncvs co src/gnu/usr.bin/grep
cvs -d:pserver:anoncvs@anoncvs.NetBSD.org:/cvsroot co pkgsrc/textproc/grep
<naddy at openbsd.org>
cvs -d:pserver:anoncvs@anoncvs1.ca.openbsd.org:/cvs co ports/sysutils/ggrep
<rse at openpkg.org>
cvs -d :pserver:anonymous@cvs.openpkg.org:/v/openpkg/cvs co openpkg-src/grep
rsync -av rsync://rsync.openpkg.org/openpkg-cvs/openpkg-src/grep/ .
<schwab at suse.de>
Free Software Foundation           Voice:  +1 617 542-5942
51 Franklin Street, Fifth Floor    Fax:    +1 617 542-2652
Boston MA 02110-1301 USA           Email:  gnu@gnu.org

The GNU Webmasters
webmasters@gnu.org

$Date: 2005/11/11 07:46:04 $
$Author: charles_levert $
savannah.gnu.org

Web site	http://www.debian.org/
Package database entry	Old stable http://packages.debian.org/oldstable/base/grep
Maintainer	Robert van der Meulen `<rvdm at debian.org>`
Package database entry	Stable http://packages.debian.org/stable/base/grep
Maintainer	Ryan M. Golbeck `<rmgolbeck at debian.org>`
Maintainer	Jeff Bailey `<jbailey at nisa.net>`
Package database entry	Testing http://packages.debian.org/testing/base/grep
Package database entry	Unstable http://packages.debian.org/unstable/base/grep
Maintainer	Anibal Monsalve Salazar `<anibal at debian.org>`
Maintainer	Santiago Ruano Rincon `<santiago at unicauca.edu.co>`
Bug tracking	http://bugs.debian.org/grep
Source package name	grep
Binary package name	grep
Entry updated	2005-11-08

Web site	http://fedora.redhat.com/
Web site	http://www.redhat.com/
Maintainer	Tim Waugh `<twaugh at redhat.com>`
Bug tracking	Red Hat Bugzilla http://bugzilla.redhat.com/
Managed repository	`cvs -d:pserver:anonymous@cvs.fedora.redhat.com:/cvs/dist co devel/grep`
Managed repository	http://cvs.fedora.redhat.com/viewcvs/devel/grep/
Source package name	grep
Binary package name	grep
Entry updated	2005-05-05

Web site	http://www.freebsd.org/
Bug tracking	http://www.freebsd.org/cgi/query-pr-summary.cgi?query
Managed repository	`CVS_RSH=ssh cvs -d:ext:freebsdanoncvs@anoncvs.FreeBSD.org:/home/ncvs co src/gnu/usr.bin/grep`
Managed repository	http://www.freebsd.org/cgi/cvsweb.cgi/src/gnu/usr.bin/grep/
Entry updated	2005-05-05

Web site	http://www.gentoo.org/
Package database entry	http://packages.gentoo.org/packages/?category=sys-apps;name=grep
Bug tracking	Gentoo Bugzilla http://bugs.gentoo.org/
Managed repository	http://www.gentoo.org/cgi-bin/viewcvs.cgi/sys-apps/grep/
Source package name	grep
Binary package name	grep
Entry updated	2005-05-05

Web site	http://www.mandrivalinux.com/
Bug tracking	Mandriva Bugzilla http://qa.mandriva.com/
Source package name	grep
Binary package name	grep
Entry updated	2005-05-05

Information for GNU grep developers

1 Generic GNU information

2 Mailing lists

2.1 The `bug-grep` mailing list

2.2 The `grep-commit` mailing list

2.3 Other deprecated mailing lists

3 Project page on Savannah

4 CVS repository

4.1 Source code

4.2 Web site

4.3 Tools

5 Roadmap

6 Release procedure

6.1 Source code compatibility with GNU awk

6.2 Internationalization (i18n) and localization (l10n)

6.3 Significant new features

6.4 Known limitations and failures

7 To do

7.1 Other implementations

7.2 POSIX

7.2.1 POSIX and `--ignore-case`

7.3 Unicode

7.3.1 Unicode and `--ignore-case`

7.4 Miscellaneous

8 Distributors

Web site	http://www.netbsd.org/
Package database entry	ftp://ftp.netbsd.org/pub/NetBSD/packages/pkgsrc/textproc/grep/README.html
Bug tracking	http://www.netbsd.org/Misc/query-pr.html
Managed repository	`cvs -d:pserver:anoncvs@anoncvs.NetBSD.org:/cvsroot co pkgsrc/textproc/grep`
Managed repository	http://cvsweb.netbsd.org/bsdweb.cgi/pkgsrc/textproc/grep/
Source package name	grep
Binary package name	grep
Entry updated	2005-05-05

Web site	http://www.openbsd.org/
Package database entry	http://www.openbsd.org/3.8_packages/i386/ggrep-2.5.1p1.tgz-long.html
Maintainer	Christian Weisgerber `<naddy at openbsd.org>`
Bug tracking	http://www.openbsd.org/query-pr.html
Managed repository	`cvs -d:pserver:anoncvs@anoncvs1.ca.openbsd.org:/cvs co ports/sysutils/ggrep`
Managed repository	http://www.openbsd.org/cgi-bin/cvsweb/ports/sysutils/ggrep/
Source package name	ggrep
Binary package name	ggrep
Entry updated	2005-11-08

Web site	http://www.openpkg.org/
Maintainer	Ralf S. Engelschall `<rse at openpkg.org>`
Managed repository	`cvs -d :pserver:anonymous@cvs.openpkg.org:/v/openpkg/cvs co openpkg-src/grep`
Managed repository	`rsync -av rsync://rsync.openpkg.org/openpkg-cvs/openpkg-src/grep/ .`
Managed repository	http://cvs.openpkg.org/dir?d=openpkg-src/grep
Source package name	grep
Binary package name	grep
Entry updated	2005-06-19

Web site	http://www.novell.com/linux/suse/
Maintainer	Andreas Schwab `<schwab at suse.de>`
Package database entry	Professional http://www.novell.com/products/linuxpackages/professional/grep.html
Source package name	grep
Binary package name	grep
Entry updated	2005-06-19

Information for GNU grep developers

1 Generic GNU information

2 Mailing lists

2.1 The bug-grep mailing list

2.2 The grep-commit mailing list

2.3 Other deprecated mailing lists

3 Project page on Savannah

4 CVS repository

4.1 Source code

4.2 Web site

4.3 Tools

5 Roadmap

6 Release procedure

6.1 Source code compatibility with GNU awk

6.2 Internationalization (i18n) and localization (l10n)

6.3 Significant new features

6.4 Known limitations and failures

7 To do

7.1 Other implementations

7.2 POSIX

7.2.1 POSIX and --ignore-case

7.3 Unicode

7.3.1 Unicode and --ignore-case

7.4 Miscellaneous

8 Distributors

2.1 The `bug-grep` mailing list

2.2 The `grep-commit` mailing list

7.2.1 POSIX and `--ignore-case`

7.3.1 Unicode and `--ignore-case`