Index | index by Group | index by Distribution | index by Vendor | index by creation date | index by Name | Mirrors | Help | Search |
Name: snowball | Distribution: Fedora Project |
Version: 2.2.0 | Vendor: Fedora Project |
Release: 7.fc39 | Build date: Sat Jul 22 04:11:55 2023 |
Group: Unspecified | Build host: buildvm-x86-15.iad2.fedoraproject.org |
Size: 302326 | Source RPM: snowball-2.2.0-7.fc39.src.rpm |
Packager: Fedora Project | |
Url: https://snowballstem.org/ | |
Summary: Snowball compiler and stemming algorithms |
Snowball is a small string processing language for creating stemming algorithms for use in Information Retrieval, plus a collection of stemming algorithms implemented using it. Snowball was originally designed and built by Martin Porter. Martin retired from development in 2014 and Snowball is now maintained as a community project. Martin originally chose the name Snowball as a tribute to SNOBOL, the excellent string handling language from the 1960s. It now also serves as a metaphor for how the project grows by gathering contributions over time. The Snowball compiler translates a Snowball program into source code in another language - currently Ada, ISO C, C#, Go, Java, Javascript, Object Pascal, Python and Rust are supported. What is Stemming? Stemming maps different forms of the same word to a common "stem" - for example, the English stemmer maps connection, connections, connective, connected, and connecting to connect. So a search for connected would also find documents which only have the other forms. This stem form is often a word itself, but this is not always the case as this is not a requirement for text search systems, which are the intended field of use. We also aim to conflate words with the same meaning, rather than all words with a common linguistic root (so awe and awful don't have the same stem), and over-stemming is more problematic than under-stemming so we tend not to stem in cases that are hard to resolve. If you want to always reduce words to a root form and/or get a root form which is itself a word then Snowball's stemming algorithms likely aren't the right answer.
BSD-3-Clause
* Sat Jul 22 2023 Fedora Release Engineering <[email protected]> - 2.2.0-7 - Rebuilt for https://fedoraproject.org/wiki/Fedora_39_Mass_Rebuild * Tue Jun 13 2023 Python Maint <[email protected]> - 2.2.0-6 - Rebuilt for Python 3.12 * Mon Feb 27 2023 Jerry James <[email protected]> - 2.2.0-5 - Dynamically generate python BuildRequires * Sat Jan 21 2023 Fedora Release Engineering <[email protected]> - 2.2.0-5 - Rebuilt for https://fedoraproject.org/wiki/Fedora_38_Mass_Rebuild * Mon Sep 26 2022 Jerry James <[email protected]> - 2.2.0-4 - Add BR on javapackages-tools for Java arches - Run the Java and python tests * Wed Sep 21 2022 Jerry James <[email protected]> - 2.2.0-3 - Initial RPM, from the libstemmer and python-snowballstemmer packages
/usr/bin/snowball /usr/bin/stemwords /usr/lib/.build-id /usr/lib/.build-id/89 /usr/lib/.build-id/89/2fff1f80023dc91de8fb06479c45ecbe25b081 /usr/lib/.build-id/de /usr/lib/.build-id/de/aa75bb42747a8ecf2dfb6772cb50ea9c74ad7d /usr/share/doc/snowball /usr/share/doc/snowball/NEWS /usr/share/doc/snowball/README.html /usr/share/licenses/snowball /usr/share/licenses/snowball/COPYING
Generated by rpm2html 1.8.1
Fabrice Bellet, Tue Jul 9 21:55:09 2024