Poliqarp for DjVu

From DM

(Difference between revisions)
Jump to: navigation, search
(2 intermediate revisions not shown)
Line 1: Line 1:
-
Draft version!
+
'''Poliqarp for DjVu''' is an open-source search engine software for DjVu corpora available on GNU GPL license, developped by Janusz S. Bień at the University of Warsaw. It relies on the DjVu format and allows to present end-users with results of advanced language technologies.
-
http://bc.klf.uw.edu.pl/177/
+
Conceived as a modification of the Poliqarp (Polyinterpretation Indexing Query and Retrieval Procesor) corpus query tool, it inherits from its origin the powerfull search facilities based on two-level regular expressions, which can be used in the queries to circumvent the OCR errors, but also the ability to represent low-level ambiguities and other linguistic phenomena. It delivers highlighted results and KWIC search results.
-
https://bitbucket.org/jwilk/marasca-wbl
+
Although at present the tool is used mainly to facilitate access to the results of dirty OCR, it is ready to handle also more sophisticated output of linguistic technologies.
-
https://bitbucket.org/mrudolf/djview-poliqarp
+
Poliqarp for DjVu is in particular used for a non-medieval corpus [http://poliqarp.wbl.klf.uw.edu.pl (corpus of historical Polish (since 1570 to 1756)], with issues related to medieval corpus (spelling, abbreviations, etc.)
-
http://poliqarp.wbl.klf.uw.edu.pl
 
-
--[[User:Jsbien|Janusz S. Bień, University of Warsaw]] 11:59, 5 May 2012 (UTC)
+
'''Source(s):'''  [http://www.downloadranking.com  Software solutions]
 +
 
 +
 
 +
== Source ==
 +
 
 +
[http://bc.klf.uw.edu.pl/177/ Presentation of the tool]
 +
 
 +
[https://bitbucket.org/jsbien/ndt/wiki/wyniki Documentation]
 +
 
[[Category:Software and Tools]]
[[Category:Software and Tools]]

Revision as of 12:04, 28 November 2012

Poliqarp for DjVu is an open-source search engine software for DjVu corpora available on GNU GPL license, developped by Janusz S. Bień at the University of Warsaw. It relies on the DjVu format and allows to present end-users with results of advanced language technologies.

Conceived as a modification of the Poliqarp (Polyinterpretation Indexing Query and Retrieval Procesor) corpus query tool, it inherits from its origin the powerfull search facilities based on two-level regular expressions, which can be used in the queries to circumvent the OCR errors, but also the ability to represent low-level ambiguities and other linguistic phenomena. It delivers highlighted results and KWIC search results.

Although at present the tool is used mainly to facilitate access to the results of dirty OCR, it is ready to handle also more sophisticated output of linguistic technologies.

Poliqarp for DjVu is in particular used for a non-medieval corpus (corpus of historical Polish (since 1570 to 1756), with issues related to medieval corpus (spelling, abbreviations, etc.)


Source(s): Software solutions


Source

Presentation of the tool

Documentation

Personal tools