From a72e66db913de3a2e508080c8b1fc8d1342a899b Mon Sep 17 00:00:00 2001 From: Ralph Amissah Date: Tue, 25 Sep 2007 23:23:03 +0100 Subject: remove generated output from main package --- .../sisu_manual/sisu_description/1.html | 3914 ---------------- .../sisu_manual/sisu_description/concordance.html | 2819 ------------ .../sisu_manual/sisu_description/digest.txt | 429 -- .../sisu_manual/sisu_description/doc.html | 4760 -------------------- .../sisu_manual/sisu_description/dom.xml | 2867 ------------ .../sisu_manual/sisu_description/endnotes.html | 1012 ----- .../sisu_manual/sisu_description/index.html | 704 --- .../sisu_manual/sisu_description/landscape.pdf | Bin 214748 -> 0 bytes .../sisu_manual/sisu_description/metadata.html | 591 --- .../sisu_manual/sisu_description/opendocument.odt | Bin 28569 -> 0 bytes .../sisu_manual/sisu_description/plain.txt | 1566 ------- .../sisu_manual/sisu_description/portrait.pdf | Bin 219308 -> 0 bytes .../sisu_manual/sisu_description/sax.xml | 2731 ----------- .../sisu_manual/sisu_description/scroll.xhtml | 2519 ----------- .../sisu_description/sisu_description.sst | 557 --- .../sisu_description/sisu_manifest.html | 197 - .../sisu_manual/sisu_description/sisupod.zip | Bin 20648 -> 0 bytes .../sisu_manual/sisu_description/sitemap.xml | 30 - .../sisu_manual/sisu_description/toc.html | 704 --- 19 files changed, 25400 deletions(-) delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/1.html delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/concordance.html delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/digest.txt delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/doc.html delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/dom.xml delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/endnotes.html delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/index.html delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/landscape.pdf delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/metadata.html delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/opendocument.odt delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/plain.txt delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/portrait.pdf delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/sax.xml delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/scroll.xhtml delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/sisu_description.sst delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/sisu_manifest.html delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/sisupod.zip delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/sitemap.xml delete mode 100644 data/doc/manuals_generated/sisu_manual/sisu_description/toc.html (limited to 'data/doc/manuals_generated/sisu_manual/sisu_description') diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/1.html b/data/doc/manuals_generated/sisu_manual/sisu_description/1.html deleted file mode 100644 index df2efb47..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/1.html +++ /dev/null @@ -1,3914 +0,0 @@ - - - - - - 1 - - SiSU - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
- - - -
- -
SiSU -->
-
- - - - - - - - - - - - - - - - - -
- - - TOC linked  toc  - - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
-
- - - - - - -
- - TOC - - - - next >> - - -
-
- - - - -
- - < - - - - ^ - - - - > - -

- SiSU -

-

- Description -

-

- Ralph Amissah - -

-

- copy @ - - SiSU - -

- - -
-

- SiSU - Description,
Ralph Amissah -

-
  -

-   1 -

-
- - -
- -   -

-   2 -

-
- - -
- -   -
- - -
-

- - 1. Description -

-
-

-   3 -

-
- - 1.1 Outline - -
- - 1.2 Short summary of features - -
- - 1.3 How it works - -
- - 1.4 Simple markup - -
- - 1.4.1 Sparse markup requirement, try to get the most out of markup - -
- - 1.4.2 Single markup file provides multiple output formats - -
- - 1.4.3 Syntax relatively easy to read and remember - -
- - 1.4.4 Kept simple by having a limited publishing feature set, and features identified as most important, are available across several document types - -
- - 1.5 Designed with usability in mind - -
- - 1.6 Code separate from content - -
- - 1.7 Object citation numbering, a text or object positioning / citation system - "paragraph" (or text object) numbering, that remains same and usable across all output formats by people and machine - -
- - 1.8 Handling of Dublin Core meta-tags making use of the Resource Description Framework - -
- - 1.9 Easy directory management - -
- - 1.10 Document Version Control Information - -
- - 1.11 Table of contents - -
- - 1.12 Auto-numbering of headings - -
- - 1.13 Numbering and cross-hyperlinking of endnotes - -
- - 1.14 "Skinnable" - -
- - 1.15 Multiple Outputs - -
- - 1.15.1 html - several presentations: full length & segmented; css & table based - -
- - 1.15.2 XML - -
- - 1.15.3 ODT:ODF, Open Document Format - ISO/IEC 26300:2006 - -
- - 1.15.4 PDF - portrait and landscape, (through the generation of LaTeX output which is then transformed to pdf) - -
- - 1.15.5 Search - loading/populating of relational database while retaining document structure information, object citation numbering and other features (currently PostgreSQL and/or SQLite) - -
- - 1.15.6 Search - database frontend sample, utilising database and SiSU features, including object citation numbering (backend currently PostgreSQL) - -
- - 1.15.7 Other forms - -
- - 1.16 Concordance / Word Map or rudimentary index - -
- - 1.17 Managed (document) directory, database, or site structure - -
- - 1.18 Batch processing - -
- - 1.19 Integration to superior Gnu/Linux and Unix tools - -
- - 1.19.1 Backup and version control - -
- - 1.19.2 Editor support - -
- - 1.20 Modular design, need something new add a module - -
- - -
- -

- 1.1 Outline -

-
-

-   4 -

-
- - -
-

- SiSU is a flexible document preparation, generation publishing and search system.  1  -

-
-

-   5 -

-
- - -
-

- SiSU ("SiSU information Structuring Universe" or "Structured information, Serialized Units"),  2  is a Unix command line oriented framework for document structuring, publishing and search. Featuring minimalistic markup, multiple standard outputs, a common citation system, and granular search. -

-
-

-   6 -

-
- - -
-

- Using markup applied to a document, SiSU can produce plain text, HTML, XHTML, XML, OpenDocument, LaTeX or PDF files, and populate an SQL database with objects  3  (equating generally to paragraph-sized chunks) so searches may be performed and matches returned with that degree of granularity (e.g. your search criteria is met by these documents and at these locations within each document). Document output formats share a common object numbering system for locating content. This is particularly suitable for "published" works (finalized texts as opposed to works that are frequently changed or updated) for which it provides a fixed means of reference of content. -

-
-

-   7 -

-
- - -
-

- SiSU is the data/information structuring and transforming tool, that has resulted from work on one of the oldest law web projects. It makes possible the one time, simple human readable markup of documents, that SiSU can then publish in various forms, suitable for paper  4  , web  5  and relational database  6  presentations, retaining common data-structure and meta-information across the output/presentation formats. Several requirements of legal and scholarly publication on the web have been addressed, including the age old need to be able to reliably cite/pinpoint text within a document, to easily make footnotes/endnotes, to allow for semantic document meta-tagging, and to keep required markup to a minimum. These and other features of interest are listed and described below. A few points are worth making early (and will be repeated a number of times): -

-
-

-   8 -

-
- - -
-

- (i) The SiSU document generator was the first to place material on the web with a system that makes possible citation across different document types, with paragraph, or rather object citation numbering  7  a text positioning system, available for the pinpointing of text, 1997, a simple idea from which much benefit, and SiSU remains today, to the best of my knowledge, the only multiple format e-book/ electronic-document system on the web that gives you this possibility (including for relational databases). -

-
-

-   9 -

-
- - -
-

- (ii) Markup is done once for the multiple formats produced. -

-
-

-   10 -

-
- - -
-

- (iii) Markup is simple, and human readable (with a little practice), in almost all cases there is less and simpler markup required than basic html. In any event the markup required is very much simpler than the html, LaTeX, [lout], structured XML, ODF (OpenDocument), PostgreSQL or SQLite feed etc. that you can have SiSU generate for you. -

-
-

-   11 -

-
- - -
-

- (iv) SiSU is a batch processor, dealing with as many files as you need to generate at a time. -

-
-

-   12 -

-
- - -
-

- (v) Scalability is dependent on your file system (in my case Reiserfs), the database (currently Postgresql and/or SQLite) and your hardware. -

-
-

-   13 -

-
- - -
-

- SiSU Sabaki  8  (or just SiSU) is the provisional name given to the software described here that helps structure documents for web and other publication. The name SiSU is a loose anagram for something along the lines of "SiSU is structuring unit", or "SiSU, information structuring unit" or the more descriptive "Structured information, Serialized Units" or "simple - information structuring unit" or the more descriptive "Structured information, Serialized Units" or what it may be directed towards "semantic and information structuring universe"  9  tongue in cheek, only just. Guess I'll get away with "Simple - information Structuring Universe". SiSU is also a Finnish word roughly meaning guts, inner strength and perseverance.  10  -

-
-

-   14 -

-
- - -
-

- SiSU was born of the need to find a way, with minimal effort, and for as wide a range of document types as possible, to produce high quality publishing output in a variety of document formats. As such it was necessary to find a simple document representation that would work across a large number of document types, and the most convenient way(s) to produce acceptable output formats. The project leading to this program was started in 1993 (together with the trade law project now known as Lex Mercatoria) as an investigation of how to effectively/efficiently place documents on the web. The unified document handling, together with features such as paragraph numbering, endnote handling and tables... appeared in 1996/97. SiSU was originally written in Perl,  11  and converted to Ruby,   12  in 2000, one of the most impressive programming languages in existence! In its current form it has been written to run on the Gnu /Linux platform, and in particular on Debian,   13  taking advantage of many of the wonderful projects that are available there. -

-
-

-   15 -

-
- - -
-

- SiSU markup is based on requiring the minimum markup needed to determine the structure of a document. (This can be as little as saying in a header to look for the word Book at a specified level and the word Chapter at another level). SiSU then breaks a document into its smallest parts (at a heading, and paragraph level) while retaining all structural information. This break up of the document and information on its structure is taken advantage of in the transformations made in generating the very different output types that can be created, and in providing as much as can be for what each output type is best at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), ODF (OpenDocument [experimental]), SQL (e.g. document search; representing constituent parts of documents based on their structure, headings, chapters, paragraphs as required; user control).  14  -

-
-

-   16 -

-
- - -
-

- From markup that is simpler and more sparse than html you get: -

-
-

-   17 -

-
- - -
-
  • - far greater output possibilities, including html, XML, ODF (OpenDocument), LaTeX (pdf), and SQL; -
  • -
    -

    -   18 -

    -
    - - -
    -
  • - the advantages implicit in the very different output possibilities; -
  • -
    -

    -   19 -

    -
    - - -
    -
  • - a common citation system (for all outputs - including the relational database, search results are relevant for all outputs); -
  • -
    -

    -   20 -

    -
    - - -
    -

    - For more see the short summary of features provided below. -

    -
    -

    -   21 -

    -
    - - -
    -

    - SiSU processes files with minimal tagging to produce various document outputs including html, LaTeX or lout (which is converted to pdf) and if required loads the structured information into an SQL database (PostgreSQL and SQLite have been used for this). SiSU produces an intermediate processing format.  15  -

    -
    -

    -   22 -

    -
    - - -
    -

    - SiSU is used in constructing Lex Mercatoria <http://lexmercatoria.org/> or <http://www.jus.uio.no/lm/> (one of the oldest law web sites), and considerable thought went into producing output that would be suitable for legal and academic writings (that do not have formulae) given the limitations of html, and publication in a wide variety of "formats", in particular in relation to the convenient and accurate citation of text. However, the construction of Lex Mercatoria uses only a fraction of the features available from SiSU today, vis generation of flat file structures, rather than in addition the building of ("granular") SQL database content, (at an object level with relevant relational tables, and other outputs also available). -

    -
    -

    -   23 -

    -
    - - -
    - -

    - 1.2 Short summary of features -

    -
    -

    -   24 -

    -
    - - -
    -

    - (i) markup syntax: (a) simpler than html, (b) mnemonic, influenced by mail/messaging/wiki markup practices, (c) human readable, and easily writable, -

    -
    -

    -   25 -

    -
    - - -
    -

    - (ii) (a) minimal markup requirement, (b) single file marked up for multiple outputs, -

    -
    -

    -   26 -

    -
    - - -
    -

    - notes: -

    -
    -

    -   27 -

    -
    - - -
    -

    - * documents are prepared in a single UTF-8 file using a minimalistic mnemonic syntax. Typical literature, documents like "War and Peace" require almost no markup, and most of the headers are optional. -

    -
    -

    -   28 -

    -
    - - -
    -

    - * markup is easily readable/parsed by the human eye, (basic markup is simpler and more sparse than the most basic html), [this may also be converted to XML representations of the same input/source document]. -

    -
    -

    -   29 -

    -
    - - -
    -

    - * markup defines document structure (this may be done once in a header pattern-match description, or for heading levels individually); basic text attributes (bold, italics, underscore, strike-through etc.) as required; and semantic information related to the document (header information, extended beyond the Dublin core and easily further extended as required); the headers may also contain processing instructions. -

    -
    -

    -   30 -

    -
    - - -
    -

    - (iii) (a) multiple outputs primarily industry established and institutionally accepted open standard formats, include amongst others: plaintext (UTF-8); html; (structured) XML; ODF (Open Document text)l; LaTeX; PDF (via LaTeX); SQL type databases (currently PostgreSQL and SQLite). Also produces: concordance files; document content certificates (md5 or sha256 digests of headings, paragraphs, images etc.) and html manifests (and sitemaps of content). (b) takes advantage of the strengths implicit in these very different output types, (e.g. PDFs produced using typesetting of LaTeX, databases populated with documents at an individual object/paragraph level, making possible granular search (and related possibilities)) -

    -
    -

    -   31 -

    -
    - - -
    -

    - (iv) outputs share a common numbering system (dubbed "object citation numbering" (ocn)) that is meaningful (to man and machine) across various digital outputs whether paper, screen, or database oriented, (PDF, html, XML, sqlite, postgresql), this numbering system can be used to reference content. -

    -
    -

    -   32 -

    -
    - - -
    -

    - (v) SQL databases are populated at an object level (roughly headings, paragraphs, verse, tables) and become searchable with that degree of granularity, the output information provides the object/paragraph numbers which are relevant across all generated outputs; it is also possible to look at just the matching paragraphs of the documents in the database; [output indexing also work well with search indexing tools like hyperesteier]. -

    -
    -

    -   33 -

    -
    - - -
    -

    - (vi) use of semantic meta-tags in headers permit the addition of semantic information on documents, (the available fields are easily extended) -

    -
    -

    -   34 -

    -
    - - -
    -

    - (vii) creates organised directory/file structure for (file-system) output, easily mapped with its clearly defined structure, with all text objects numbered, you know in advance where in each document output type, a bit of text will be found (e.g. from an SQL search, you know where to go to find the prepared html output or PDF etc.)... there is more; easy directory management and document associations, the document preparation (sub-)directory may be used to determine output (sub-)directory, the skin used, and the SQL database used, -

    -
    -

    -   35 -

    -
    - - -
    -

    - (viii) "Concordance file" wordmap, consisting of all the words in a document and their (text/ object) locations within the text, (and the possibility of adding vocabularies), -

    -
    -

    -   36 -

    -
    - - -
    -

    - (ix) document content certification and comparison considerations: (a) the document and each object within it stamped with an md5 hash making it possible to easily check or guarantee that the substantive content of a document is unchanged, (b)version control, documents integrated with time based source control system, default RCS or CVS with use of $Id: sisu_description.sst,v 1.25 2007/08/23 12:22:36 ralph Exp $ tag, which SiSU checks -

    -
    -

    -   37 -

    -
    - - -
    -

    - (x) SiSU's minimalist markup makes for meaningful "diffing" of the substantive content of markup-files, -

    -
    -

    -   38 -

    -
    - - -
    -

    - (xi) easily skinnable, document appearance on a project/site wide, directory wide, or document instance level easily controlled/changed, -

    -
    -

    -   39 -

    -
    - - -
    -

    - (xii) in many cases a regular expression may be used (once in the document header) to define all or part of a documents structure obviating or reducing the need to provide structural markup within the document, -

    -
    -

    -   40 -

    -
    - - -
    -

    - (xiii) prepared files may be batch process, documents produced are static files so this needs to be done only once but may be repeated for various reasons as desired (updated content, addition of new output formats, updated technology document presentations/representations) -

    -
    -

    -   41 -

    -
    - - -
    -

    - (xiv) possible to pre-process, which permits: the easy creation of standard form documents, and templates/term-sheets, or; building of composite documents (master documents) from other sisu marked up documents, or marked up parts, i.e. import documents or parts of text into a main document should this be desired -

    -
    -

    -   42 -

    -
    - - -
    -

    - there is a considerable degree of future-proofing, output representations are "upgradeable", and new document formats may be added. -

    -
    -

    -   43 -

    -
    - - -
    -

    - (xv) there is a considerable degree of future-proofing, output representations are "upgradeable", and new document formats may be added: (a) modular, (thanks in no small part to Ruby) another output format required, write another module.... (b) easy to update output formats (eg html, XHTML, LaTeX/PDF produced can be updated in program and run against whole document set), (c) easy to add, modify, or have alternative syntax rules for input, should you need to, -

    -
    -

    -   44 -

    -
    - - -
    -

    - (xvi) scalability, dependent on your file-system (ext3, Reiserfs, XFS, whatever) and on the relational database used (currently Postgresql and SQLite), and your hardware, -

    -
    -

    -   45 -

    -
    - - -
    -

    - (xvii) only marked up files need be backed up, to secure the larger document set produced, -

    -
    -

    -   46 -

    -
    - - -
    -

    - (xviii) document management, -

    -
    -

    -   47 -

    -
    - - -
    -

    - (xix) Syntax highlighting for SiSU markup is available for a number of text editors. -

    -
    -

    -   48 -

    -
    - - -
    -

    - (xx) remote operations: (a) run SiSU on a remote server, (having prepared sisu markup documents locally or on that server, i.e. this solution where sisu is installed on the remote server, would work whatever type of machine you chose to prepare your markup documents on), (b) generated document outputs may be posted by sisu to remote sites (using rsync/scp) (c)document source (plaintext utf-8) if shared on the net may be identified by its url and processed locally to produce the different document outputs. -

    -
    -

    -   49 -

    -
    - - -
    -

    - (xxi) document source may be bundled together (automatically) with associated documents (multiple language versions or master document with inclusions) and images and sent as a zip file called a sisupod, if shared on the net these too may be processed locally to produce the desired document outputs, these may be downloaded, shared as email attachments, or processed by running sisu against them, either using a url or the filename. -

    -
    -

    -   50 -

    -
    - - -
    -

    - (xxii) for basic document generation, the only software dependency is Ruby, and a few standard Unix tools (this covers plaintext, html, XML, ODF, LaTeX). To use a database you of course need that, and to convert the LaTeX generated to PDF, a LaTeX processor like tetex or texlive. -

    -
    -

    -   51 -

    -
    - - -
    -

    - as a developers tool it is flexible and extensible -

    -
    -

    -   52 -

    -
    - - -
    -

    - SiSU was developed in relation to legal documents, and is strong across a wide variety of texts (law, literature...). SiSU handles images but is not suitable for formulae/ statistics, or for technical writing at this time. -

    -
    -

    -   53 -

    -
    - - -
    -

    - SiSU has been developed and has been in use for several years. Requirements to cover a wide range of documents within its use domain have been explored. -

    -
    -

    -   54 -

    -
    - - -
    -

    - Some modules are more mature than others, the most mature being Html and LaTeX / pdf. PostgreSQL and search functions are useable and together with ocn unique (to the best of my knowledge). The XML output document set is "well formed" but largely proof of concept. -

    -
    -

    -   55 -

    -
    - - -
    - -

    - 1.3 How it works -

    -
    -

    -   56 -

    -
    - - -
    -

    - SiSU markup is fairly minimalistic, it consists of: a (largely optional) document header, made up of information about the document (such as when it was published, who authored it, and granting what rights) and any processing instructions; and markup within text which is related to document structure and typeface. SiSU must be able to discern the structure of a document, (text headings and their levels in relation to each other), either from information provided in the instruction header or from markup within the text (or from a combination of both). Processing is done against an abstraction of the document comprising of information on the document's structure and its objects,  16  which the program serializes (providing the object numbers) and which are assigned hash sum values based on their content. This abstraction of information about document structure, objects, (and hash sums), provides considerable flexibility in representing documents different ways and for different purposes (e.g. search, document layout, publishing, content certification, concordance etc.), and makes it possible to take advantage of some of the strengths of established ways of representing documents, (or indeed to create new ones). -

    -
    -

    -   57 -

    -
    - - -
    - -

    - 1.4 Simple markup -

    -
    -

    -   58 -

    -
    - - -
    -

    - SiSU markup is based on requiring the minimum markup needed to determine the structure of a document. (This can be as little as saying in a header to look for the word Book at a specified level and the word Chapter at another level). SiSU then breaks a document into its smallest parts (at a heading, and paragraph level) while retaining all structural information. This break up of the document and information on its structure is taken advantage of in the transformations made in generating the very different output types that can be created, and in providing as much as can be for what each output type is best at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), ODF (OpenDocument), SQL (e.g. document search; representing constituent parts of documents based on their structure, headings, chapters, paragraphs as required; user control).  17  -

    -
    -

    -   59 -

    -
    - - -
    - -

    - 1.4.1 Sparse markup requirement, try to get the most out of markup -

    -
    -

    -   60 -

    -
    - - -
    -

    - One of its strengths is that very small amounts of initial tagging is required for the program to generate its output. -

    -
    -

    -   61 -

    -
    - - -
    -

    - This is a basic markup example: -

    -
    -

    -   62 -

    -
    - - -
    -
  • - basic markup example, text file - an international convention   18  -
  • -
    -

    -   63 -

    -
    - - -
    -
  • - view basic markup, as it would be highlighted by vim editor   19  -
  • -
    -

    -   64 -

    -
    - - -
    -

    - Emphasis has been on simplicity and minimalism in markup requirements. Design philosophy is to try keep the amount of markup required low, for whatever has been determined to be acceptable output.  20  -

    -
    -

    -   65 -

    -
    - - -
    -

    - SiSU's markup is more minimalistic and simpler than (the equivalent) html and for it, you get considerably more than just html, as this preparation gives you all available output formats, upon request. -

    -
    -

    -   66 -

    -
    - - -
    - -

    - 1.4.2 Single markup file provides multiple output formats -

    -
    -

    -   67 -

    -
    - - -
    -

    - For each document, there is only one (input, minimalistically marked up) file from which all the available output types are generated.  21  -

    -
    -

    -   68 -

    -
    - - -
    -

    - Eg. the markup example: -

    -
    -

    -   69 -

    -
    - - -
    -
  • - original text file - an international convention   22  -
  • -
    -

    -   70 -

    -
    - - -
    -
  • - view as syntax would be highlighted by vim editor   23  -
  • -
    -

    -   71 -

    -
    - - -
    -

    - Produces the following output: -

    -
    -

    -   72 -

    -
    - - -
    -
  • - Segmented html version of document   24  -
  • -
    -

    -   73 -

    -
    - - -
    -
  • - Full length html document   25  -
  • -
    -

    -   74 -

    -
    - - -
    -
  • - pdf landscape version of document   26  -
  • -
    -

    -   75 -

    -
    - - -
    -
  • - pdf portrait version of document   27  -
  • -
    -

    -   76 -

    -
    - - -
    -
  • - clean tex ascii version of document   28  -
  • -
    -

    -   77 -

    -
    - - -
    -
  • - xml sax version of document   29  -
  • -
    -

    -   78 -

    -
    - - -
    -
  • - xml dom version of document   30  -
  • -
    -

    -   79 -

    -
    - - -
    -
  • - Concordance   31  -
  • -
    -

    -   80 -

    -
    - - -
    -

    - (and in addition to these: PostgreSQL, SQLite, texinfo and YAML   32  versions if desired) -

    -
    -

    -   81 -

    -
    - - -
    - -

    - 1.4.3 Syntax relatively easy to read and remember -

    -
    -

    -   82 -

    -
    - - -
    -

    - Syntax is kept simple and mnemonic.  33  -

    -
    -

    -   83 -

    -
    - - -
    - -

    - 1.4.4 Kept simple by having a limited publishing feature set, and features identified as most important, are available across several document types -

    -
    -

    -   84 -

    -
    - - -
    -

    - To keep SiSU markup sparse and simple SiSU deliberately provides a limited publishing feature set, including: indent levels; bold; italics; superscript; subscript; simple tables; images; tables of contents and; endnotes. Which in most cases are available across the different output formats. -

    -
    -

    -   85 -

    -
    - - -
    -

    - The publishing feature set may be expanded as required. -

    -
    -

    -   86 -

    -
    - - -
    - -

    - 1.5 Designed with usability in mind -

    -
    -

    -   87 -

    -
    - - -
    -

    - Output is designed to be uniform, easy to read, navigate and cite. -

    -
    -

    -   88 -

    -
    - - -
    - -

    - 1.6 Code separate from content -

    -
    -

    -   89 -

    -
    - - -
    -

    - Code  34  is separated from content. This means that when changes are desired in the output presentation, the code that produces them, and not the marked up text data set (which could be thousands of documents) is modified. Separating code from content makes large scale changes to output appearance trivial, and permits the easy addition of new output modules. -

    -
    -

    -   90 -

    -
    - - -
    - -

    - 1.7 Object citation numbering, a text or object positioning / citation system - "paragraph" (or text object) numbering, that remains same and usable across all output formats by people and machine -

    -
    -

    -   91 -

    -
    - - -
    -

    - Object citation numbering is a simple object (text) positioning and cition system that is human relevant and machine useable, used by SiSU for all manner of presentations, and that is available for use in all text mappings. It is based on the automated sequential numbering of objects (roughly paragraphs, (headings, tables, verse) or other blocks of text or images etc.). The text positioning system (in which I claim copyright) is invaluable for publishing requiring the citing text across multiple output formats, and for the general mapping of text within a document: -

    -
    -

    -   92 -

    -
    - - -
    -
  • - in html, html not being easily citeable (change font size, or use a different browser and the page on which specific text appears has changed), and -
  • -
    -

    -   93 -

    -
    - - -
    -
  • - across multiple formats being common to all output formats html/xml/pdf/sql output, -
  • -
    -

    -   94 -

    -
    - - -
    -
  • - the results of an sql search can just be "live" citation references to the documents in which the text is found, much like an index (see image examples provided).   35  -
  • -
    -

    -   95 -

    -
    - - -
    -

    - I claim copyright on the system I use which is the most basic of all, numbering all text in headings and paragraphs sequentially (with tables and images being treated as a single paragraph) and only footnotes/endnotes not following this numbering, as their position in text is not strictly determined, (a change from footnotes to endnotes would change their numbering), footnotes instead "belong" to the paragraph from which they are referenced, and have sequential numbers of their own. -

    -
    -

    -   96 -

    -
    - - -
    -

    - SiSU has a paragraph numbering system, that remains the same regardless of the output format. This provides an effective means of citation, pinpointing text accurately in all output formats, using the same reference. This is particularly useful where text has to be located across different output formats - for example once html is printed the number of pages and pages on which given text is found will vary depending on the browser, its settings the font size setting etc. Similarly SiSU produces pdf in different forms, eg. on the example site Lex Mercatoria as portrait and landscape documents - here too page numbering varies, but paragraph numbering is the same, vis a vis all versions of the text (portrait and landscape pdf and the html versions of the text, and as stored (with "paragraphs" as records) to the PostgreSQL or SQLite database). -

    -
    -

    -   97 -

    -
    - - -
    -

    - These numbers are placed in the text margins and are intended to be independent of and not to interfere with authors tagging. [The citation system (object citation numbering system, automated "paragraph numbering") which is automatically generated and is common and identical across all document formats] The paragraph numbering system is more accurately described as an (text) object numbering system, as headings are also numbered... all headings and paragraphs are numbered sequentially. Endnotes are automatically numbered independently and rather "belong" to the paragraph from which they are referenced, as an endnote does not (necessarily) form a part of a documents sequence, (they may be produced as either endnotes or footnotes (or both depending on what output you choose to look at - if you take the segmented html version document provided as an example, you will find that the endnotes are placed both at the end of each section, and in a separate section of their own called endnotes, and these are hyper-linked)). An attractive feature of providing citation numbering in this way is that it is independent of the document structure... it remains the same regardless of what is done about the document structure. -

    -
    -

    -   98 -

    -
    - - -
    -

    - The rules have been kept very simple, unique incremental object citation numbers are assigned to headings, paragraphs, verse, tables and images. It is possible to manually override this feature on a per heading or comment basis though this should be used exceptionally, it may be of use where there a substantive text, and the addition of a minor comment by the publisher that should not be mapped as part of the text. -

    -
    -

    -   99 -

    -
    - - -
    -

    - The object citation number markers contain additional numbering information with regard to the document structure, that can be used for alternative presentations, including such detail as the type of object (heading, paragraph, table, image, etc.), numbered sequentially. -

    -
    -

    -   100 -

    -
    - - -
    -

    - An advantage is that the numbering remains the same regardless of document structure. -

    -
    -

    -   101 -

    -
    - - -
    -

    - Text object ("paragraph") numbering is the same for all output versions of the same document, vis html, pdf, pgsql, yaml etc. -

    -
    -

    -   102 -

    -
    - - -
    -

    - In the relational database, as individual text objects of a document stored (and indexed) together with object numbers, and all versions of the document have the same numbering, the results of searches may be tailored just to provide the location of the search result in all available document formats. -

    -
    -

    -   103 -

    -
    - - -
    -

    - Note: there is a bug in the released behaviour of object citation numbering, (not certain when it was introduced) tables should be numbered, ie each table gets an ocn, required amongst other things for relational database. This will be corrected in a future release. Citation numbering of existing documents that contain tables will changed. -

    -
    -

    -   104 -

    -
    - - -
    - -

    - 1.8 Handling of Dublin Core meta-tags making use of the Resource Description Framework -

    -
    -

    -   105 -

    -
    - - -
    -

    - SiSU is able to use meta tags based on the Dublin Core  36  and Resource Description Framework  37  -

    -
    -

    -   106 -

    -
    - - -
    -

    - This provides the means of providing semantic information about a document, both as computer processable meta-tags, and as human readable information that may be of value for classification purposes. -

    -
    -

    -   107 -

    -
    - - -
    -

    - This information is provided both in html metatags, and (where available) under the section titled "Document Information - MetaData", near the end of a document, for example in the segmented html version of this text at: <http://www.jus.uio.no/sisu/SiSU/metadata.html> -

    -
    -

    -   108 -

    -
    - - -
    - -

    - 1.9 Easy directory management -

    -
    -

    -   109 -

    -
    - - -
    -

    - 1. Directory file association, skins and special image management, made simpler.  38  -

    -
    -

    -   110 -

    -
    - - -
    -

    - The last part of the name of the work directory in which markup is being done, or rather from where SiSU is run in order to generate document output, is used in determining the sub-directory name for output files, that is created in the document output directory. This provides a rather easy way to associate documents e.g. of a given subject, or by owner. -

    -
    -

    -   111 -

    -
    - - -
    -

    -   /www/docs
          /intellectual_property
          /arbitration
          /contract_law

      /www/docs
          /ralph
          /sisu -

    -
    -

    -   112 -

    -
    - - -
    -

    - all are placed in their own directories within the directory structure created. Similar rules are used in the creation of sql type databases (though they can be overridden). -

    -
    -

    -   113 -

    -
    - - -
    -

    - There are a couple of further associations with these directories. -

    -
    -

    -   114 -

    -
    - - -
    -

    - Directory wide skins. -

    -
    -

    -   115 -

    -
    - - -
    -

    - Directory specific images. -

    -
    -

    -   116 -

    -
    - - -
    -

    - 2. If there is a "directory skin", that is a skin of the same name as the directory, it is used in the generation of the documents within it, rather than the default skin, unless the document has a specific skin associated with it. -

    -
    -

    -   117 -

    -
    - - -
    -

    - a. default skin (always available) -

    -
    -

    -   118 -

    -
    - - -
    -

    - b. directory skin (precedence over default if exists) -

    -
    -

    -   119 -

    -
    - - -
    -

    - c. document skin (takes precedence wherever document requests a specific skin) -

    -
    -

    -   120 -

    -
    - - -
    -

    - Skins are defined in the document skin directory and if a directory association is desired a softlink made to the relevant skin. Skins (directory association auto load) auto load skin if a directory skin exists of same name as directory stub, (and there is no specific doc skin) -

    -
    -

    -   121 -

    -
    - - -
    -

    - 3. If the working directory has within it a sub-directory called image_local, the images within that directory are used for references to images, that are not part of the default site build. -

    -
    -

    -   122 -

    -
    - - -
    - -

    - 1.10 Document Version Control Information -

    -
    -

    -   123 -

    -
    - - -
    -

    - The possibility of citing an exact document version. -

    -
    -

    -   124 -

    -
    - - -
    -

    - Permits the inclusion of document version control information to the document body and metatags.  39  This provides a much more certain method of referring to the exact version of a particular document, (assuming that the document is from a trusted source, that will retain earlier versions of a document).  40  -

    -
    -

    -   125 -

    -
    - - -
    -

    - This information (where available) is provided under the section of the document titled "Document Information - MetaData", near the end of a document, for example in the segmented html version of this text at: <http://www.jus.uio.no/sisu/SiSU/metadata.html> -

    -
    -

    -   126 -

    -
    - - -
    - -

    - 1.11 Table of contents -

    -
    -

    -   127 -

    -
    - - -
    -

    - SiSU produces a rudimentary a table of contents based on document headings. -

    -
    -

    -   128 -

    -
    - - -
    - -

    - 1.12 Auto-numbering of headings -

    -
    -

    -   129 -

    -
    - - -
    -

    - Headings can be automatically numbered, (and automatically named for hyper-linking) -

    -
    -

    -   130 -

    -
    - - -
    - -

    - 1.13 Numbering and cross-hyperlinking of endnotes -

    -
    -

    -   131 -

    -
    - - -
    -

    - SiSU can automatically number footnotes/endnotes. This is the default operation where no number is provided. -

    -
    -

    -   132 -

    -
    - - -
    -

    - Footnotes/endnotes may also be manually numbered. Where a number, or numbers are provided for a footnote/endnote, this does not increment the automatic footnote/endnote number counter. -

    -
    -

    -   133 -

    -
    - - -
    -

    - In the html output footnotes/endnotes are cross-hyper-linked (to their reference point and vice versa). In th pdf output footnotes are linked from their reference point only. -

    -
    -

    -   134 -

    -
    - - -
    - -

    - 1.14 "Skinnable" -

    -
    -

    -   135 -

    -
    - - -
    -

    - SiSU is skinnable, on a site-wide, directory-wide and per document basis, so different looking versions of things may be produced with little difficulty. There is a default skin which may be modified, as the background site skin, and each working directory may have a skin associated with it, as may each individual document. The hierarchy of application is document, directory, then site... ie if a document skin exists it gets precedence. -

    -
    -

    -   136 -

    -
    - - -
    -

    - Whilst it is skinnable, the default output styles are selected to work across the widest possible range of document types. -

    -
    -

    -   137 -

    -
    - - -
    - -

    - 1.15 Multiple Outputs -

    -
    -

    -   138 -

    -
    - - -
    -

    - From markup that is simpler and more sparse than html you get: -

    -
    -

    -   139 -

    -
    - - -
    -
  • - far greater output possibilities, including multiple html types, XML (different structured types), LaTeX (pdf landscape, portrait), and SQL (Postgresql or SQLite or other); -
  • -
    -

    -   140 -

    -
    - - -
    -
  • - the advantages implicit in these very different output possibilities;  41  -
  • -
    -

    -   141 -

    -
    - - -
    -
  • - a common citation system -
  • -
    -

    -   142 -

    -
    - - -
    -

    - As many output formats/presentations as one cares to write modules for - several types of html (e.g. structure based on css, or structure based on tables); LaTeX/pdf and Lout/pdf; pgsql other databases easily added; yaml... -

    -
    -

    -   143 -

    -
    - - -
    - -

    - 1.15.1 html - several presentations: full length & segmented; css & table based -

    -
    -

    -   144 -

    -
    - - -
    -

    - Most documents are produced in single and segmented html versions, described below: -

    -
    -

    -   145 -

    -
    - - -
    -

    - The Scroll (full length text presentations) -

    -
    -

    -   146 -

    -
    - - -
    -

    - The full length of the text in a single scrollable document.  42  As a rule the files they are saved in are named: doc or more precisely doc.html -

    -
    -

    -   147 -

    -
    - - -
    -

    - For various reasons texts may only be provided in this form (such as this one which is short), though most are also provided as segmented texts. -

    -
    -

    -   148 -

    -
    - - -
    -

    - "Scroll" is a reference to the historical scroll, a single long document/ parchment, and also no doubt to what you will have to do to get to the bottom of the text.  43  -

    -
    -

    -   149 -

    -
    - - -
    -

    - The Segmented Text -

    -
    -

    -   150 -

    -
    - - -
    -

    - The text divided into segments (such as articles or chapters depending on the text)  44  As a rule the files they are saved in are named: toc and index or more precisely toc.html and index.html -

    -
    -

    -   151 -

    -
    - - -
    -

    - If you know exactly what you are looking for, loading a segment of text is faster (the segments being smaller). Occasionally longer documents such as the WTA 1994 <http://www.jus.uio.no/lm/wta.1994/toc> are only provided in segmented form. -

    -
    -

    -   152 -

    -
    - - -
    -

    - Cascading Style Sheet, and Table based html -

    -
    -

    -   153 -

    -
    - - -
    -

    - SiSU outputs html, two current standard forms available are: -

    -
    -

    -   154 -

    -
    - - -
    -

    - css based -

    -
    -

    -   155 -

    -
    - - -
    -

    - and -

    -
    -

    -   156 -

    -
    - - -
    -

    - table based [largely discontinued ]  45  -

    -
    -

    -   157 -

    -
    - - -
    -

    - The html is tested across several browsers -

    -
    -

    -   158 -

    -
    - - -
    -

    - I like to remind you that there are other excellent browsers out there, many of which have long supported practical features like tabbing. -

    -
    -

    -   159 -

    -
    - - -
    -

    - The html is tested across several browsers, including: -

    -
    -

    -   160 -

    -
    - - -
    -
  • - Firefox (Mozilla-Firefox)   46  -
  • -
    -

    -   161 -

    -
    - - -
    -
  • - Kazehakase   47  -
  • -
    -

    -   162 -

    -
    - - -
    -
  • - Konqueror   48  -
  • -
    -

    -   163 -

    -
    - - -
    -
  • - Mozilla   49  -
  • -
    -

    -   164 -

    -
    - - -
    -
  • - MS Internet Explorer   50  -
  • -
    -

    -   165 -

    -
    - - -
    -
  • - Netscape   51  -
  • -
    -

    -   166 -

    -
    - - -
    -
  • - Opera   52  -
  • -
    -

    -   167 -

    -
    - - -
    -

    - Also lighter weight graphical browsers: -

    -
    -

    -   168 -

    -
    - - -
    -
  • - Dillo   53  -
  • -
    -

    -   169 -

    -
    - - -
    -
  • - Epiphany   54  -
  • -
    -

    -   170 -

    -
    - - -
    -
  • - Galeon   55  -
  • -
    -

    -   171 -

    -
    - - -
    -

    - And for console/text browsing: -

    -
    -

    -   172 -

    -
    - - -
    -
  • - elinks   56  -
  • -
    -

    -   173 -

    -
    - - -
    -
  • - links2   57  -
  • -
    -

    -   174 -

    -
    - - -
    -
  • - w3m   58  -
  • -
    -

    -   175 -

    -
    - - -
    -

    - The html tables output is rendered more accurately across a wider variety set and older versions of browsers (than the html css output). -

    -
    -

    -   176 -

    -
    - - -
    - -

    - 1.15.2 XML -

    -
    -

    -   177 -

    -
    - - -
    -

    - SiSU generates well formed XML, and multiple versions. An XML SAX version with a flat/shallow structure, and XML DOM version with a deeper (embedded) structure. There is also a released working xhtml module. Examples of SAX and DOM versions are provided within this document. -

    -
    -

    -   178 -

    -
    - - -
    - -

    - 1.15.3 ODT:ODF, Open Document Format - ISO/IEC 26300:2006 -

    -
    -

    -   179 -

    -
    - - -
    -

    - SiSU generates Open Document Output format. -

    -
    -

    -   180 -

    -
    - - -
    - -

    - 1.15.4 PDF - portrait and landscape, (through the generation of LaTeX output which is then transformed to pdf) -

    -
    -

    -   181 -

    -
    - - -
    -

    - SiSU outputs LaTeX if required which is easily transformed to PDF.  59  PDF documents are generated on the site from the same source files and Ruby program that produce html. Landscape oriented pdf introduced, providing easier screen viewing, they are also (paper saving, being currently) formatted to have fewer pages than their portrait equivalents. -

    -
    -

    -   182 -

    -
    - - -
    -
  • - Adobe Reader   60  -
  • -
    -

    -   183 -

    -
    - - -
    -
  • - Evince   61  -
  • -
    -

    -   184 -

    -
    - - -
    -
  • - xpdf   62  -
  • -
    -

    -   185 -

    -
    - - -
    - -

    - 1.15.5 Search - loading/populating of relational database while retaining document structure information, object citation numbering and other features (currently PostgreSQL and/or SQLite) -

    -
    -

    -   186 -

    -
    - - -
    -

    - SiSU (from the same markup input file) automatically feeds into PostgreSQL  63  and/or SQLite  64  database (could be any other of the better relational databases)  65  - together with all additional information related to document structure, and the alternative ways in which it is generated on the site retained. As regards scaling of the database, it is as scalable as the database (here Postgresql or SQLite) and hardware allow. I will prune the images later. -

    -
    -

    -   187 -

    -
    - - -
    -

    - This is one of the more interesting output forms, as all the structural data for the documents are retained (though can be ignored by the user of the database should they so choose). All site texts/documents are (currently) streamed to four pgsql database tables: -

    -
    -

    -   188 -

    -
    - - -
    -
  • - one containing semantic (and other) headers, including, title, author, subject, (the Dublin Core...); -
  • -
    -

    -   189 -

    -
    - - -
    -
  • - another the substantive texts by individual "paragraph" (or object) - along with structural information, each paragraph being identifiable by its paragraph number (if it has one which almost all of them do), and the substantive text of each paragraph quite naturally being searchable (both in formatted and clean text versions for searching); and -
  • -
    -

    -   190 -

    -
    - - -
    -
  • - a third containing endnotes cross-referenced back to the paragraph from which they are referenced (both in formatted and clean text versions for searching). -
  • -
    -

    -   191 -

    -
    - - -
    -
  • - a fourth table with a one to one relation with the headers table contains full text versions of output, eg. pdf, html, xml, and ascii. -
  • -
    -

    -   192 -

    -
    - - -
    -

    - There is of course the possibility to add further structures. -

    -
    -

    -   193 -

    -
    - - -
    -

    - At this level SiSU loads a relational database with documents broken in to their smallest logical structurally constituent parts, as text objects, with their object citation number and all other structural information needed to construct the structured document. Text is stored (at this text object level) with and without elementary markup tagging, the stripped version being so as to facilitate ease of searching. -

    -
    -

    -   194 -

    -
    - - -
    -

    - Because the document structure of sites created is clearly defined, and the text object citation system is available for all forms of output, it is possible to search the sql database, and either read results from that database, or just as simply map the results to the html output, which has richer text markup. -

    -
    -

    -   195 -

    -
    - - -
    -

    - The combination of the SiSU citation system with a relational database is pretty powerful, giving rise to several possibilities. As individual text objects of a document stored (and indexed) together with object numbers, and all versions of the document have the same numbering, complex searches can be tailored to return just the locations of the search results relevant for all available output formats, with live links to the precise locations in the database or in html/xml documents; or, the structural information provided makes it possible to search the full contents of the database and have headings in which search content appears, or to search only headings etc. (as the Dublin Core is incorporated it is easy to make use of that as well). -

    -
    -

    -   196 -

    -
    - - -
    -

    - This is a larger scale project, (with little development on the front end largely ignored), though the "infrastructure" has been in place since 2002. -

    -
    -

    -   197 -

    -
    - - -
    - -

    - 1.15.6 Search - database frontend sample, utilising database and SiSU features, including object citation numbering (backend currently PostgreSQL) -

    -
    -

    -   198 -

    -
    - - -
    -

    - Sample search frontend   66  A small database and sample query front-end (search from) that makes use of the citation system, object citation numbering to demonstrates functionality.  67  -

    -
    -

    -   199 -

    -
    - - -
    -

    - SiSU can provide information on which documents are matched and at what locations within each document the matches are found. These results are relevant across all outputs using object citation numbering, which includes html, XML, LaTeX, PDF and indeed the SQL database. You can then refer to one of the other outputs or in the SQL database expand the text within the matched objects (paragraphs) in the documents matched. -

    -
    -

    -   200 -

    -
    - - -
    -

    - (further work needs to be done on the sample search form, which is rudimentary and only passes simple booleans correctly at present to the SQL engine) -

    -
    -

    -   201 -

    -
    - - -
    -

    - A few canned searches, showing object numbers. Search for: -

    -
    -

    -   202 -

    -
    - - -
    -

    - English documents matching Linux OR Debian -

    -
    -

    -   203 -

    -
    - - -
    -

    - GPL OR Richard Stallman -

    -
    -

    -   204 -

    -
    - - -
    -

    - invention OR innovation in English language -

    -
    -

    -   205 -

    -
    - - -
    -

    - copyright in English language documents -

    -
    -

    -   206 -

    -
    - - -
    -

    - Note that the searches done in this form are case sensitive. -

    -
    -

    -   207 -

    -
    - - -
    -

    - Expand those same searches, showing the matching text in each document: -

    -
    -

    -   208 -

    -
    - - -
    -

    - English documents matching Linux OR Debian -

    -
    -

    -   209 -

    -
    - - -
    -

    - GPL OR Richard Stallman -

    -
    -

    -   210 -

    -
    - - -
    -

    - invention OR innovation in English language -

    -
    -

    -   211 -

    -
    - - -
    -

    - copyright in English language documents -

    -
    -

    -   212 -

    -
    - - -
    -

    - Note you may set results either for documents matched and object number locations within each matched document meeting the search criteria; or display the names of the documents matched along with the objects (paragraphs) that meet the search criteria.  68  -

    -
    -

    -   213 -

    -
    - - -
    -

    - OCN index mode, (object citation number) the numbers displayed are relevant (and may be used to reference the match) in any sisu generated rendition of the text  69  the links provided are to the locations of matches within the html generated by SiSU. -

    -
    -

    -   214 -

    -
    - - -
    -

    - Paragraph mode, you may alternatively display the text of each paragraph in which the match was made, again the object/paragraph numbers are relevant to any SiSU generated/published text. -

    -
    -

    -   215 -

    -
    - - -
    -

    - Several options for output - select database to search, show results in index view (links to locations within text), show results with text, echo search in form, show what was searched, create and show a "canned url" for search, show available search fields. Also shows counters number of documents in which found and number of locations within documents where found. [could consider sorting by document with most occurrences of the search result]. -

    -
    -

    -   216 -

    -
    - - -
    -

    - Earlier version of the search frontend - Simple search, results with files in which search found, and locations where found within files. -

    -
    -

    -   217 -

    -
    - - -
    -

    - Simple search, results with files in which search found, and text object (paragraph or endnote) where found within files. -

    -
    -

    -   218 -

    -
    - - -
    - -

    - 1.15.7 Other forms -

    -
    -

    -   219 -

    -
    - - -
    -

    - There are other forms as well, YAML file, Ruby Marshal dumps, document pre-processing (processing of documents prior to the steps described here, to produce input suitable for the program) snap in a new module as required/desired, well formed XML, no problem. -

    -
    -

    -   220 -

    -
    - - -
    - -

    - 1.16 Concordance / Word Map or rudimentary index -

    -
    -

    -   221 -

    -
    - - -
    -

    - Concordance /WordMaps:  70  SiSU produces a rudimentary index based on the words within the text, making use of paragraph numbers to identify text locations. This is generated in html and hyper-linked but identifies these words locations in the other document formats. Though it is possible to search using a search engine, this is a means for browsing an alphabetical list of words which may suggest other useful content. -

    -
    -

    -   222 -

    -
    - - -
    - -

    - 1.17 Managed (document) directory, database, or site structure -

    -
    -

    -   223 -

    -
    - - -
    -

    - SiSU builds the web site (or more generically provides a suitable directory structure) - placing various output texts in the hierarchy of the web-site (or db), which (for directories) is a sub-directory with the name of the text file. -

    -
    -

    -   224 -

    -
    - - -
    - -

    - 1.18 Batch processing -

    -
    -

    -   225 -

    -
    - - -
    -

    - SiSU is a batch processing tool, handling and transforming multiple (or individual) documents (in many ways) with a single instruction. -

    -
    -

    -   226 -

    -
    - - -
    - -

    - 1.19 Integration to superior Gnu/Linux and Unix tools -

    -
    -

    -   227 -

    -
    - - -
    -

    - As should have been noted by the above description of SiSU, it makes use of existing programs found on Gnu /Linux and Unix, amongst those already mentioned include the LaTeX to pdf converters and the database PostgreSQL or SQLite. -

    -
    -

    -   228 -

    -
    - - -
    - -

    - 1.19.1 Backup and version control -

    -
    -

    -   229 -

    -
    - - -
    -

    - Unix provides many tools for version control. For documents Subversion, CVS and even the old RCS are useful for the per-document histories they provide. -

    -
    -

    -   230 -

    -
    - - -
    -

    - For writing code superior (more recent) version control system exist. These can also be used for documents though they tend to take stamps of changes across the repository as a whole, rather than for each individual file that is tracked, (as CVS and RCS do). My personal preference is for distributed systems such as Git, Mercurial or Darcs, of which I use Git for both code and documents. -

    -
    -

    -   231 -

    -
    - - -
    -

    - Several backup tools exist. At the base level I tend to use rdiff. -

    -
    -

    -   232 -

    -
    - - -
    - -

    - 1.19.2 Editor support -

    -
    -

    -   233 -

    -
    - - -
    -

    - SiSU documents are prepared / marked up in utf-8 text you are free to use the text editor of your choice. -

    -
    -

    -   234 -

    -
    - - -
    -

    - Syntax highlighting for a number of editors are provided. Amongst them Vim, Kwrite, Kate, Gedit and diakonos. These may be found with configuration instructions at <http://www.jus.uio.no/sisu/syntax_highlight>. Vim   71  as of version 7 has built in sytax highlighting for SiSU. -

    -
    -

    -   235 -

    -
    - - -
    - -

    - 1.20 Modular design, need something new add a module -

    -
    -

    -   236 -

    -
    - - -
    -

    - Need a new output format that does not already exist, write a new module. -

    -
    -

    -   237 -

    -
    - - -
    -

    - Prefer a new input syntax, you could write a new syntax matching the existing design, though my personal preference is some uniformity in entry appearance. If necessary has been fairly easy to extend the design parameters. It is intended to incorporate some additional basic semantic tagging, (book, article, author etc.) However, keeping the requirements for input minimal, and relatively simple has been a design goal. -

    -
    -

    -   238 -

    -

    -^

    - - -
    -

    -  1. This information was first placed on the web 12 November 2002; with predating material taken from <http://www.jus.uio.no/lm/lm.information/toc.html> part of a site started and developed since 1993. See document metadata section <http://www.jus.uio.no/sisu/SiSU/metadata.html> for information on this version. Dates related to the development of SiSU are mostly contained within the Chronology section of this document, e.g. <http://www.jus.uio.no/sisu/sisu_chronology> -

    -
      -
    - - -
    -

    -  2. also chosen for the meaning of the Finnish term "sisu". -

    -
      -
    - - -
    -

    -  3. objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. -

    -
      -
    - - -
    -

    -  4. pdf via LaTeX or lout -

    -
      -
    - - -
    -

    -  5. currently html (two forms of html presentation one based on css the other on tables), and PHP; potentially structured XML -

    -
      -
    - - -
    -

    -  6. any SQL - currently PostgreSQL and sqlite (for portability, testing and development) -

    -
      -
    - - -
    -

    -  7. previously called "text object numbering" -

    -
      -
    - - -
    -

    -  8. SiSU Sabaki, release version. Pre-release version SiSU Scribe, and version prior to that SiSU nicknamed Scribbler. Pre-release versions go back several years. Both Scribbler and Scribe (still maintained) made system calls to SiSU's various parts, instead of using libraries. -

    -
      -
    - - -
    -

    -  9. A little universe it may be, but semantic you may have a hard time getting away with, given the meaning the word has taken on with markup. On a document wide basis semantic information may be provided, which can be really useful, (and meaningful, especially) if you have a large document set, and use this with rss feeds or in an sql database etc. On a markup level, I have little inclination to add semantic markup formally beyond references, title, author [Dublin Core entities? addresses?] etc. Actually this deserves a bit of thought possibly use letter tags (including letter alias/synonyms for font faces) to create a small set of default semantic tags, with the possibility for per document adjustments. Will seek to permit XML entity tagging, within SiSU markup and have that ignored/removed by the parts of the program that have no use for it. -

    -
      -
    - - -
    -

    -  10. "Sisu refers not to the courage of optimism, but to a concept of life that says, 'I may not win, but I will gladly give my life for what I believe.'" Aini Rajanen, Of Finnish Ways, 1981, p. 10. -

    -
      -
    - - -
    -

    - <http://www.humanlanguages.com/finnishenglish/rlfs.htm> -

    -
      -
    - - -
    -

    - "Every Finn has his own pet definition. To me, sisu means patience without passion. But there are many varieties of sisu. Sisu can be a sudden outburst or it can be the kind that lasts. A man can have both kinds. It is outside reason. It is something in the soul. It comes from oneself. For instance, it makes a soldier do things because he himself must, not because he has been told." Paavo Nurmi -

    -
      -
    - - -
    -

    - <http://personalweb.smcvt.edu/tmatikainen/finnishtraditions.htm> -

    -
      -
    - - -
    -

    -  11. <http://www.perl.org/> -

    -
      -
    - - -
    -

    -  12. <http://www.ruby-lang.org/en/> -

    -
      -
    - - -
    -

    -  13. <http://www.debian.org/> -

    -
      -
    - - -
    -

    -  14. where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. -

    -
      -
    - - -
    -

    -  15. This proved to be the easiest way to develop syntax, changes could be made, or alternatives provided for the markup syntax whilst the intermediate markup syntax was largely held constant. There is actually an optional second intermediate markup format in YAML <http://www.yaml.org/> -

    -
      -
    - - -
    -

    -  16. objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. -

    -
      -
    - - -
    -

    -  17. where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. -

    -
      -
    - - -
    -

    -  18. <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> output provided as example in the next section -

    -
      -
    - - -
    -

    -  19. <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> as it would appear with syntax highlighting (by vim) -

    -
      -
    - - -
    -

    -  20. seems there are several "smart ASCIIs" available, primarily for ascii to html conversion, that make this, and reasonable looking ascii their goal -

    -
      -
    - - -
    -

    - <http://webseitz.fluxent.com/wiki/SmartAscii> -

    -
      -
    - - -
    -

    - <http://daringfireball.net/projects/markdown/> -

    -
      -
    - - -
    -

    - <http://www.textism.com/tools/textile/> -

    -
      -
    - - -
    -

    -  21. These include richly laid out and linked html (table or css variants), PHP, LaTeX (from which pdf portrait and landscape documents are produced), texinfo (for info files etc.), and PostgreSQL and/or SQLite. And the opportunity to fairly easily build additional modules, such as XML. See the examples provided in this document. -

    -
      -
    - - -
    -

    -  22. <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> -

    -
      -
    - - -
    -

    -  23. <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> -

    -
      -
    - - -
    -

    -  24. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html> -

    -
      -
    - - -
    -

    -  25. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/doc.html> -

    -
      -
    - - -
    -

    -  26. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/landscape.pdf> -

    -
      -
    - - -
    -

    -  27. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/portrait.pdf> -

    -
      -
    - - -
    -

    -  28. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/plain.txt> -

    -
      -
    - - -
    -

    -  29. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/sax.xml> -

    -
      -
    - - -
    -

    -  30. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/dom.xml> -

    -
      -
    - - -
    -

    -  31. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/concordance.html> -

    -
      -
    - - -
    -

    -  32. discontinued for the time being -

    -
      -
    - - -
    -

    -  33. SiSU markup syntax, an incomplete summary: <http://www.jus.uio.no/sisu/sisu_markup_table/doc.html#h200306> -

    -
      -
    - - -
    -

    - Visual check of elementary font face modifiers: bold bold emphasis italics underscore strikethrough superscript subscript -

    -
      -
    - - -
    -

    -  34. the program that generates the documents -

    -
      -
    - - -
    -

    -  35. <http://www.jus.uio.no/sisu/SiSU/1.html#search> -

    -
      -
    - - -
    -

    -  36. <http://dublincore.org/> -

    -
      -
    - - -
    -

    -  37. <http://www.w3.org/RDF/> -

    -
      -
    - - -
    -

    -  38. The previous way was directory associations for file output were set up in the configuration file. The present system is a more natural way to work requireing less configuration. -

    -
      -
    - - -
    -

    -  39. from a version control system such as CVS -

    -
      -
    - - -
    -

    -  40. The version control system must be run, so the version number is obtained, prior to the SiSU document generation, and subsequent posting of the document. -

    -
      -
    - - -
    -

    -  41. e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), SQL (e.g. document set searches; representation of the constituent parts of documents based on their structure, headings, chapters, paragraphs as desired; control of use) -

    -
      -
    - - -
    -

    -  42. CISG <http://www.jus.uio.no/lm/un_contracts_international_sale_of_goods_convention_1980/doc> -

    -
      -
    - - -
    -

    - The Unidroit Contract Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994/doc> or -

    -
      -
    - - -
    -

    - The Autonomous Contract <http://www.jus.uio.no/lm/autonomous.contract.2000.amissah/doc> -

    -
      -
    - - -
    -

    -  43. Scrolling is not however necessarily confined to full length documents as you will have to scroll to get to the bottom of any long segment (eg. chapter) of a segmented text. -

    -
      -
    - - -
    -

    -  44. CISG <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980> -

    -
      -
    - - -
    -

    - The Unidroit Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994> -

    -
      -
    - - -
    -

    - The Autonomous Contract <http://www.jus.uio.no/sisu/the.autonomous.contract.2000.amissah> or -

    -
      -
    - - -
    -

    - WTA 1994 <http://www.jus.uio.no/lm/wta.1994> -

    -
      -
    - - -
    -

    -  45. formatting possibility still exists in code tree but maintenance has been largely discontinuted. -

    -
      -
    - - -
    -

    -  46. <http://www.mozilla.org/products/firefox/> -

    -
      -
    - - -
    -

    -  47. <http://kazehakase.sourceforge.jp/> -

    -
      -
    - - -
    -

    -  48. <http://www.konqueror.org/> -

    -
      -
    - - -
    -

    -  49. <http://www.mozilla.org/> -

    -
      -
    - - -
    -

    -  50. <http://www.microsoft.com/windows/ie/default.asp> -

    -
      -
    - - -
    -

    -  51. <http://home.netscape.com/comprod/mirror/client_download.html> -

    -
      -
    - - -
    -

    -  52. <http://www.opera.com/> -

    -
      -
    - - -
    -

    -  53. <http://www.dillo.org/> -

    -
      -
    - - -
    -

    -  54. <http://www.gnome.org/projects/epiphany/> -

    -
      -
    - - -
    -

    -  55. <http://galeon.sourceforge.net/> -

    -
      -
    - - -
    -

    -  56. <http://elinks.or.cz/> -

    -
      -
    - - -
    -

    -  57. <http://links.twibright.com/> -

    -
      -
    - - -
    -

    -  58. <http://w3m.sourceforge.net/> -

    -
      -
    - - -
    -

    -  59. LaTeX and pdf features introduced 18th June 2001, Landscape and portrait pdfs introduced 7th October 2001., Lout is a more recent addition 22th April 2003 -

    -
      -
    - - -
    -

    -  60. <http://www.adobe.com/products/acrobat/readstep2.html> -

    -
      -
    - - -
    -

    -  61. <http://www.gnome.org/projects/evince/> -

    -
      -
    - - -
    -

    -  62. <http://www.foolabs.com/xpdf/> -

    -
      -
    - - -
    -

    -  63. <http://www.postgresql.org/> -

    -
      -
    - - -
    -

    - <http://advocacy.postgresql.org/> -

    -
      -
    - - -
    -

    - <http://en.wikipedia.org/wiki/Postgresql> -

    -
      -
    - - -
    -

    -  64. <http://www.hwaci.com/sw/sqlite/> -

    -
      -
    - - -
    -

    - <http://en.wikipedia.org/wiki/Sqlite> -

    -
      -
    - - -
    -

    -  65. Relational database features retaining document structure and citation introduced 15th July 2002 -

    -
      -
    - - -
    -

    -  66. <http://search.sisudoc.org> -

    -
      -
    - - -
    -

    -  67. (which could be extended further with current back-end). As regards scaling of the database, it is as scalable as the database (here Postgresql) and hardware allow. -

    -
      -
    - - -
    -

    -  68. of this feature when demonstrated to an IBM software innovations evaluator in 2004 he said to paraphrase: this could be of interest to us. We have large document management systems, you can search hundreds of thousands of documents and we can tell you which documents meet your search criteria, but there is no way we can tell you without opening each document where within each your matches are found. -

    -
      -
    - - -
    -

    -  69. OCN are provided for HTML, XML, pdf ... though currently omitted in plain-text and opendocument format output -

    -
      -
    - - -
    -

    -  70. Concordance/ WordMaps introduced 15th August 2002 -

    -
      -
    - - -
    -

    -  71. <http://www.vim.org/> -

    -
      -
     
    - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - - - -
    - - - TOC linked  toc  - - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    - - - - - - -
    - - TOC - - - - next >> - - -
    -
    - - - - -
    - - < - - - - ^ - - - - > - -

    - - - - - - - - -
    - -
    -

    - - SiSU - -

    -
    - -

    - Output generated by - - SiSU - - 0.59.1 2007-09-25 (2007w39/2) -
    - - SiSU - - Copyright © Ralph Amissah - 1997, current 2007. - All Rights Reserved. -
    - - SiSU - - is software for document structuring, publishing and search, -
    - - www.jus.uio.no/sisu - - and - - www.sisudoc.org - -
    - w3 since October 3 1993 - - ralph@amissah.com - -

    -
    -

    - SiSU using: -
    Standard SiSU markup syntax, -
    Standard SiSU meta-markup syntax, and the -
    Standard SiSU object citation numbering and system, (object/text positioning system) -
    - Copyright © Ralph Amissah 1997, current 2007. - All Rights Reserved. -

    -
    -

    - - GPLv3 - -

    -
    -

    - - SiSU - - is released under - GPLv3 - or later, - <http://www.gnu.org/licenses/gpl.html> -

    -
    -

    - SiSU, developed using - - Ruby - - on - - Debian/Gnu/Linux - - software infrastructure, - with the usual GPL (or OSS) suspects. -
    - Better - "performance, reliability, scalability, security & total cost of ownership" - [not to mention flexibility & choice] use of and adherence to open standards (where practical and fair) and it is software libre. -
    - Get With the Future - - Way Better! - -

    -
    - - - - - -
    -
    - - -
    - - -
    - - - - idx - txt - - -
    -
    - -
    -

    - - -SiSU manual - -

    - - - - - - - - - - - -
    -

    - - -SiSU - -

    - - - - - - - - - - - - - - - - -
    - \ No newline at end of file diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/concordance.html b/data/doc/manuals_generated/sisu_manual/sisu_description/concordance.html deleted file mode 100644 index b82e2ea0..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/concordance.html +++ /dev/null @@ -1,2819 +0,0 @@ - - - - - SiSU created WordIndex for: SiSU - Description - - - - - - - - - - - - - - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - -
    - - - TOC linked  toc  - - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    -   - TOC -   -
    -

    -
     

    SiSU - Description

    Ralph Amissah

    - -

    Word index links are to html versions of the text the segmented version followed by the scroll (single document) version.
    [For segmented text references [T1], [T2] or [T3] appearing without a link, indicates that the word appears in a title (or subtitle) of the text (that is identifiable by the appended object citation number).]

    -

    (The word listing/index is Case sensitive: Capitalized words appear before lower case)

    -

    - word (number of occurences)
    linked references to word within document
    - [if number of occurences exceed number of references - word occurs more than once in at least one reference. Footnote/endnotes are either assigned to the paragraph from which they are referenced or ignored, so it is relevant to check the footnotes referenced from within a paragraph as well.] -

    -

    - (After the page is fully loaded) you can jump directly to a word by appending a hash (#) and the word to the url for this text, (do not forget that words are case sensitive, and may be listed twice (starting with and without an upper case letter)), #your_word # [ http://[web host]/sisu_description/concordance.html#your_word ] -

    - -

    Able

    (3)

    -

    8, 57, 106,

    -

    About

    (4)

    -

    57, 98, 107,

    -

    Above

    (1)

    -

    228,

    -

    Abstraction

    (2)

    -

    57,

    -

    Academic

    (1)

    -

    23,

    -

    Acceptable

    (2)

    -

    15, 65,

    -

    Accepted

    (1)

    -

    31,

    -

    Accurate

    (1)

    -

    23,

    -

    Accurately

    (3)

    -

    97, 98, 176,

    -

    Across

    (19)

    -

    8, 9, 15, 32, 33, 53, 84, 85, 91, 92, 94, 97, 98, 137, 158, 160, 176, 200, 231,

    -

    Actually

    (2)

    -

    14, 22,

    -

    Add

    (4)

    -

    14, 44, 193, 236,

    -

    Added

    (3)

    -

    43, 44, 143,

    -

    Adding

    (1)

    -

    36,

    -

    Addition

    (7)

    -

    23, 34, 41, 81, 90, 99, 182,

    -

    Additional

    (4)

    -

    68, 100, 187, 238,

    -

    Addressed

    (1)

    -

    8,

    -

    Addresses

    (1)

    -

    14,

    -

    Adjustments

    (1)

    -

    14,

    -

    Adobe

    (1)

    -

    183,

    -

    Advance

    (1)

    -

    35,

    -

    Advantage

    (6)

    -

    15, 16, 31, 57, 59, 101,

    -

    Advantages

    (2)

    -

    19, 141,

    -

    Again

    (1)

    -

    215,

    -

    Against

    (3)

    -

    44, 50, 57,

    -

    Age

    (1)

    -

    8,

    -

    Aini

    (1)

    -

    14,

    -

    Alias

    (1)

    -

    14,

    -

    All

    (34)

    -

    11, 16, 20, 33, 35, 36, 40, 59, 66, 68, 91, 92, 94, 96, 97, 98, 102, 103, 113, 187, 188, 190, 194, 195, 196, 200,

    -

    Allow

    (3)

    -

    8, 187, 199,

    -

    Almost

    (3)

    -

    11, 28, 190,

    -

    Along

    (3)

    -

    14, 190, 213,

    -

    Alphabetical

    (1)

    -

    222,

    -

    Already

    (2)

    -

    228, 237,

    -

    Also

    (17)

    -

    6, 14, 23, 29, 30, 31, 33, 98, 133, 148, 149, 168, 178, 182, 216, 231,

    -

    Alternative

    (3)

    -

    44, 100, 187,

    -

    Alternatively

    (1)

    -

    215,

    -

    Alternatives

    (1)

    -

    22,

    -

    Always

    (1)

    -

    118,

    -

    Amissah

    (1)

    -

    [H]1,

    -

    Amongst

    (4)

    -

    31, 104, 228, 235,

    -

    Amount

    (1)

    -

    65,

    -

    Amounts

    (1)

    -

    61,

    -

    Anagram

    (1)

    -

    14,

    -

    And

    (225)

    -

    5, 6, 7, 8, 9, 11, 13, 14, 15, 16, 17, 18, 22, 23, 25, 28, 29, 30, 31, 32, 33, 35, 36, 37, 42, 43, 44, 45, 49, 50, 51, 52, 53, 54, 55, 57, 59, 65, 66, 68, 81, 82, 83, 84, 85, 88, 90, 91, 92, 93, 96, 97, 98, 99, 103, 106, 107, 108, 110, 121, 125, 130, 131, 134, 136, 139, 140, 143, 145, 149, 151, 153, 156, 172, 176, 178, 181, 182, 186, 187, 189, 190, 191, 192, 194, 195, 196, 198, 199, 200, 201, 213, 214, 216, 217, 218, 222, 226, 227, 228, 229, 230, 231, 235, 238,

    -

    Another

    (5)

    -

    16, 44, 59, 190,

    -

    Any

    (7)

    -

    8, 11, 57, 149, 187, 214, 215,

    -

    Appear

    (1)

    -

    64,

    -

    Appearance

    (3)

    -

    39, 90, 238,

    -

    Appeared

    (1)

    -

    15,

    -

    Appears

    (2)

    -

    93, 196,

    -

    Application

    (1)

    -

    136,

    -

    Applied

    (1)

    -

    7,

    -

    April

    (1)

    -

    182,

    -

    Arbitration

    (1)

    -

    112,

    -

    Are

    (78)

    -

    5, 7, 8, 14, 15, 20, 28, 33, 34, 41, 43, 44, 55, 57, 65, 68, 84, 85, 90, 96, 98, 99, 113, 114, 121, 122, 133, 134, 137, 145, 147, 148, 151, 152, 154, 159, 178, 182, 188, 191, 200, 207, 213, 214, 215, 220, 230, 234, 235,

    -

    Article

    (1)

    -

    238,

    -

    Articles

    (1)

    -

    151,

    -

    Ascii

    (4)

    -

    65, 77, 192,

    -

    ASCIIs

    (1)

    -

    65,

    -

    Assigned

    (2)

    -

    57, 99,

    -

    Associate

    (1)

    -

    111,

    -

    Associated

    (3)

    -

    50, 117, 136,

    -

    Association

    (3)

    -

    110, 121,

    -

    Associations

    (3)

    -

    35, 110, 114,

    -

    Assuming

    (1)

    -

    125,

    -

    Attachments

    (1)

    -

    50,

    -

    Attempt

    (1)

    -

    [H]2,

    -

    Attractive

    (1)

    -

    98,

    -

    Attributes

    (1)

    -

    30,

    -

    August

    (1)

    -

    222,

    -

    Author

    (3)

    -

    14, 189, 238,

    -

    Authored

    (1)

    -

    57,

    -

    Authors

    (1)

    -

    98,

    -

    Auto

    (3)

    -

    121, 129,

    -

    Automated

    (2)

    -

    92, 98,

    -

    Automatic

    (1)

    -

    133,

    -

    Automatically

    (7)

    -

    50, 98, 130, 132, 187,

    -

    Autonomous

    (2)

    -

    147, 151,

    -

    Available

    (20)

    -

    9, 15, 23, 34, 48, 65, 66, 68, 84, 85, 92, 103, 108, 118, 126, 154, 195, 196, 216,

    -

    Away

    (2)

    -

    14,

    -

    Back

    (3)

    -

    14, 191, 199,

    -

    Backed

    (1)

    -

    46,

    -

    Backend

    (1)

    -

    198,

    -

    Background

    (1)

    -

    136,

    -

    Backup

    (2)

    -

    229, 232,

    -

    Base

    (1)

    -

    232,

    -

    Based

    (18)

    -

    8, 16, 37, 57, 59, 92, 106, 128, 141, 143, 144, 153, 155, 157, 222,

    -

    Basic

    (10)

    -

    11, 29, 30, 51, 62, 63, 64, 96, 238,

    -

    Basis

    (3)

    -

    14, 99, 136,

    -

    Batch

    (4)

    -

    12, 41, 225, 226,

    -

    Because

    (3)

    -

    14, 195,

    -

    Become

    (1)

    -

    33,

    -

    Been

    (15)

    -

    8, 14, 15, 22, 54, 65, 99, 157, 197, 228, 238,

    -

    Behaviour

    (1)

    -

    104,

    -

    Being

    (11)

    -

    55, 81, 93, 94, 96, 111, 152, 182, 190, 194,

    -

    Believe

    (1)

    -

    14,

    -

    Belong

    (2)

    -

    96, 98,

    -

    Below

    (3)

    -

    8, 21, 145,

    -

    Benefit

    (1)

    -

    9,

    -

    Best

    (4)

    -

    9, 16, 55, 59,

    -

    Better

    (1)

    -

    187,

    -

    Beyond

    (2)

    -

    14, 30,

    -

    Bit

    (2)

    -

    14, 35,

    -

    Blocks

    (1)

    -

    92,

    -

    Body

    (1)

    -

    125,

    -

    Bold

    (4)

    -

    30, 83, 85,

    -

    Book

    (4)

    -

    9, 16, 59, 238,

    -

    Booleans

    (1)

    -

    201,

    -

    Born

    (1)

    -

    15,

    -

    Both

    (10)

    -

    14, 57, 98, 107, 108, 190, 191, 231,

    -

    Bottom

    (2)

    -

    149,

    -

    Break

    (2)

    -

    16, 59,

    -

    Breaks

    (2)

    -

    16, 59,

    -

    Broken

    (1)

    -

    194,

    -

    Browser

    (2)

    -

    93, 97,

    -

    Browsers

    (5)

    -

    158, 159, 160, 168, 176,

    -

    Browsing

    (2)

    -

    172, 222,

    -

    Bug

    (1)

    -

    104,

    -

    Build

    (2)

    -

    68, 122,

    -

    Building

    (2)

    -

    23, 42,

    -

    Builds

    (1)

    -

    224,

    -

    Built

    (1)

    -

    235,

    -

    Bundled

    (1)

    -

    50,

    -

    But

    (15)

    -

    7, 14, 16, 41, 53, 55, 57, 59, 97, 157, 213, 222,

    -

    Called

    (4)

    -

    9, 50, 98, 122,

    -

    Calls

    (1)

    -

    14,

    -

    Can

    (30)

    -

    7, 8, 11, 14, 16, 32, 44, 59, 95, 100, 113, 130, 132, 188, 196, 200, 213, 231,

    -

    Canned

    (2)

    -

    202, 216,

    -

    Cares

    (1)

    -

    143,

    -

    Cascading

    (1)

    -

    153,

    -

    Case

    (5)

    -

    13, 16, 59, 141, 207,

    -

    Cases

    (5)

    -

    11, 16, 40, 59, 85,

    -

    Certain

    (2)

    -

    104, 125,

    -

    Certificates

    (1)

    -

    31,

    -

    Certification

    (2)

    -

    37, 57,

    -

    Change

    (3)

    -

    93, 96,

    -

    Changed

    (4)

    -

    7, 39, 93, 104,

    -

    Changes

    (4)

    -

    22, 90, 231,

    -

    Chapter

    (3)

    -

    16, 59, 149,

    -

    Chapters

    (4)

    -

    16, 59, 141, 151,

    -

    Characters

    (2)

    -

    16, 59,

    -

    Check

    (2)

    -

    37, 83,

    -

    Checks

    (1)

    -

    37,

    -

    Cheek

    (1)

    -

    14,

    -

    Choice

    (1)

    -

    234,

    -

    Choose

    (2)

    -

    98, 188,

    -

    Chose

    (1)

    -

    49,

    -

    Chosen

    (1)

    -

    6,

    -

    Chronology

    (1)

    -

    5,

    -

    Chunks

    (1)

    -

    7,

    -

    CISG

    (2)

    -

    147, 151,

    -

    Citation

    (29)

    -

    6, 9, 20, 23, 32, 91, 92, 95, 97, 98, 99, 100, 104, 142, 186, 187, 194, 195, 196, 198, 199, 200, 214,

    -

    Cite

    (2)

    -

    8, 88,

    -

    Citeable

    (1)

    -

    93,

    -

    Citing

    (2)

    -

    92, 124,

    -

    Cition

    (1)

    -

    92,

    -

    Claim

    (2)

    -

    92, 96,

    -

    Classification

    (1)

    -

    107,

    -

    Clean

    (3)

    -

    77, 190, 191,

    -

    Clearly

    (2)

    -

    35, 195,

    -

    Code

    (7)

    -

    89, 90, 157, 231,

    -

    Combination

    (2)

    -

    57, 196,

    -

    Comes

    (1)

    -

    14,

    -

    Command

    (1)

    -

    6,

    -

    Comment

    (2)

    -

    99,

    -

    Common

    (8)

    -

    6, 7, 8, 20, 32, 94, 98, 142,

    -

    Comparison

    (1)

    -

    37,

    -

    Complex

    (1)

    -

    196,

    -

    Composite

    (1)

    -

    42,

    -

    Comprising

    (1)

    -

    57,

    -

    Computer

    (1)

    -

    107,

    -

    Concept

    (2)

    -

    14, 55,

    -

    Concordance

    (7)

    -

    31, 36, 57, 80, 221, 222,

    -

    Configuration

    (3)

    -

    110, 235,

    -

    Confined

    (1)

    -

    149,

    -

    Consider

    (1)

    -

    216,

    -

    Considerable

    (4)

    -

    23, 43, 44, 57,

    -

    Considerably

    (1)

    -

    66,

    -

    Considerations

    (1)

    -

    37,

    -

    Consisting

    (1)

    -

    36,

    -

    Consists

    (1)

    -

    57,

    -

    Console

    (1)

    -

    172,

    -

    Constant

    (1)

    -

    22,

    -

    Constituent

    (4)

    -

    16, 59, 141, 194,

    -

    Construct

    (1)

    -

    194,

    -

    Constructing

    (1)

    -

    23,

    -

    Construction

    (1)

    -

    23,

    -

    Contain

    (3)

    -

    30, 100, 104,

    -

    Contained

    (1)

    -

    5,

    -

    Containing

    (2)

    -

    189, 191,

    -

    Contains

    (1)

    -

    192,

    -

    Content

    (17)

    -

    7, 23, 31, 32, 37, 38, 41, 57, 89, 90, 196, 222,

    -

    Contents

    (4)

    -

    85, 127, 128, 196,

    -

    Contract

    (3)

    -

    147, 151,

    -

    Control

    (12)

    -

    16, 37, 59, 123, 125, 141, 229, 230, 231,

    -

    Controlled

    (1)

    -

    39,

    -

    Convenient

    (2)

    -

    15, 23,

    -

    Convention

    (2)

    -

    63, 70,

    -

    Conversion

    (4)

    -

    16, 59, 65, 141,

    -

    Convert

    (1)

    -

    51,

    -

    Converted

    (3)

    -

    15, 22, 29,

    -

    Converters

    (1)

    -

    228,

    -

    Copyright

    (4)

    -

    92, 96, 206, 212,

    -

    Core

    (6)

    -

    14, 30, 105, 106, 189, 196,

    -

    Corrected

    (1)

    -

    104,

    -

    Correctly

    (1)

    -

    201,

    -

    Could

    (9)

    -

    16, 22, 59, 90, 187, 199, 213, 216, 238,

    -

    Counter

    (1)

    -

    133,

    -

    Counters

    (1)

    -

    216,

    -

    Couple

    (1)

    -

    114,

    -

    Courage

    (1)

    -

    14,

    -

    Course

    (2)

    -

    51, 193,

    -

    Cover

    (1)

    -

    54,

    -

    Covers

    (1)

    -

    51,

    -

    Create

    (3)

    -

    14, 57, 216,

    -

    Created

    (5)

    -

    16, 59, 111, 113, 195,

    -

    Creates

    (1)

    -

    35,

    -

    Creation

    (2)

    -

    42, 113,

    -

    Criteria

    (4)

    -

    7, 213,

    -

    Cross

    (3)

    -

    131, 134, 191,

    -

    Css

    (6)

    -

    8, 68, 143, 144, 155, 176,

    -

    Current

    (3)

    -

    15, 154, 199,

    -

    Currently

    (10)

    -

    8, 13, 31, 45, 182, 186, 188, 198, 214,

    -

    CVS

    (4)

    -

    37, 125, 230, 231,

    -

    Darcs

    (1)

    -

    231,

    -

    Data

    (4)

    -

    8, 90, 188,

    -

    Database

    (38)

    -

    7, 8, 13, 14, 20, 22, 23, 32, 33, 35, 45, 51, 97, 103, 104, 186, 187, 188, 194, 195, 196, 198, 199, 200, 216, 223, 228,

    -

    Databases

    (7)

    -

    9, 31, 33, 113, 143, 187,

    -

    Dates

    (1)

    -

    5,

    -

    Dealing

    (1)

    -

    12,

    -

    Debian

    (3)

    -

    15, 203, 209,

    -

    Deeper

    (1)

    -

    178,

    -

    Default

    (9)

    -

    14, 37, 117, 118, 119, 122, 132, 136, 137,

    -

    Define

    (1)

    -

    40,

    -

    Defined

    (3)

    -

    35, 121, 195,

    -

    Defines

    (1)

    -

    30,

    -

    Definition

    (1)

    -

    14,

    -

    Degree

    (4)

    -

    7, 33, 43, 44,

    -

    Deliberately

    (1)

    -

    85,

    -

    Demonstrated

    (1)

    -

    213,

    -

    Demonstrates

    (1)

    -

    199,

    -

    Dependency

    (1)

    -

    51,

    -

    Dependent

    (2)

    -

    13, 45,

    -

    Depending

    (3)

    -

    97, 98, 151,

    -

    Describe

    (1)

    -

    [H]2,

    -

    Described

    (5)

    -

    8, 14, 98, 145, 220,

    -

    Description

    (6)

    -

    [H]1, 3, 30, 105, 106, 228,

    -

    Descriptive

    (2)

    -

    14,

    -

    Deserves

    (1)

    -

    14,

    -

    Design

    (5)

    -

    65, 236, 238,

    -

    Designed

    (2)

    -

    87, 88,

    -

    Desired

    (8)

    -

    41, 42, 50, 81, 90, 121, 141, 220,

    -

    Detail

    (1)

    -

    100,

    -

    Determine

    (3)

    -

    16, 35, 59,

    -

    Determined

    (2)

    -

    65, 96,

    -

    Determining

    (1)

    -

    111,

    -

    Develop

    (1)

    -

    22,

    -

    Developed

    (3)

    -

    5, 53, 54,

    -

    Developers

    (1)

    -

    52,

    -

    Development

    (3)

    -

    5, 8, 197,

    -

    Diakonos

    (1)

    -

    235,

    -

    Different

    (15)

    -

    9, 16, 19, 31, 49, 57, 59, 85, 93, 97, 136, 140, 141,

    -

    Difficulty

    (1)

    -

    136,

    -

    Diffing

    (1)

    -

    38,

    -

    Digests

    (1)

    -

    31,

    -

    Digital

    (1)

    -

    32,

    -

    Dillo

    (1)

    -

    169,

    -

    Directed

    (1)

    -

    14,

    -

    Directories

    (3)

    -

    113, 114, 224,

    -

    Directory

    (31)

    -

    35, 39, 109, 110, 111, 113, 115, 116, 117, 119, 121, 122, 136, 223, 224,

    -

    Discern

    (1)

    -

    57,

    -

    Discontinued

    (2)

    -

    81, 157,

    -

    Discontinuted

    (1)

    -

    157,

    -

    Display

    (2)

    -

    213, 215,

    -

    Displayed

    (1)

    -

    214,

    -

    Distributed

    (1)

    -

    231,

    -

    Divided

    (1)

    -

    151,

    -

    Doc

    (3)

    -

    121, 147,

    -

    Docs

    (2)

    -

    112,

    -

    Document

    (142)

    -

    5, 6, 7, 8, 9, 14, 15, 16, 22, 29, 30, 31, 35, 36, 37, 39, 40, 41, 42, 43, 44, 46, 47, 49, 50, 51, 55, 57, 59, 68, 73, 74, 75, 76, 77, 78, 79, 84, 92, 98, 100, 101, 102, 103, 107, 108, 111, 117, 120, 121, 123, 124, 125, 126, 128, 136, 137, 141, 147, 149, 178, 179, 180, 186, 187, 194, 195, 196, 200, 208, 213, 216, 220, 222, 223, 230,

    -

    Documents

    (62)

    -

    7, 8, 14, 15, 16, 28, 31, 33, 34, 37, 40, 41, 42, 49, 50, 53, 54, 57, 59, 68, 90, 95, 97, 98, 104, 111, 117, 141, 145, 149, 152, 182, 188, 194, 196, 200, 203, 206, 209, 212, 213, 216, 220, 226, 230, 231, 234,

    -

    Does

    (3)

    -

    98, 133, 237,

    -

    Doing

    (2)

    -

    16, 59,

    -

    Dom

    (1)

    -

    79,

    -

    DOM

    (2)

    -

    178,

    -

    Domain

    (1)

    -

    54,

    -

    Done

    (8)

    -

    10, 30, 41, 57, 98, 111, 201, 207,

    -

    Doubt

    (1)

    -

    149,

    -

    Downloaded

    (1)

    -

    50,

    -

    Dubbed

    (1)

    -

    32,

    -

    Dublin

    (6)

    -

    14, 30, 105, 106, 189, 196,

    -

    Dumps

    (1)

    -

    220,

    -

    Each

    (20)

    -

    7, 16, 35, 37, 57, 59, 68, 98, 104, 136, 190, 200, 208, 213, 215, 231,

    -

    Earlier

    (2)

    -

    125, 217,

    -

    Early

    (1)

    -

    8,

    -

    Ease

    (1)

    -

    194,

    -

    Easier

    (1)

    -

    182,

    -

    Easiest

    (1)

    -

    22,

    -

    Easily

    (13)

    -

    8, 25, 29, 30, 34, 35, 37, 39, 68, 93, 143, 182,

    -

    Easy

    (14)

    -

    16, 35, 42, 44, 59, 82, 88, 90, 109, 111, 141, 196, 238,

    -

    Echo

    (1)

    -

    216,

    -

    Editor

    (4)

    -

    64, 71, 233, 234,

    -

    Editors

    (2)

    -

    48, 235,

    -

    Effective

    (1)

    -

    97,

    -

    Effectively

    (1)

    -

    15,

    -

    Efficiently

    (1)

    -

    15,

    -

    Effort

    (1)

    -

    15,

    -

    Either

    (5)

    -

    50, 57, 98, 195, 213,

    -

    Electronic

    (1)

    -

    9,

    -

    Elementary

    (2)

    -

    83, 194,

    -

    Elinks

    (1)

    -

    173,

    -

    Email

    (1)

    -

    50,

    -

    Embedded

    (1)

    -

    178,

    -

    Emphasis

    (2)

    -

    65, 83,

    -

    End

    (6)

    -

    98, 108, 126, 197, 199,

    -

    Endnote

    (5)

    -

    15, 98, 133, 218,

    -

    Endnotes

    (15)

    -

    7, 8, 57, 85, 96, 98, 131, 132, 133, 134, 191,

    -

    Engine

    (2)

    -

    201, 222,

    -

    English

    (6)

    -

    203, 205, 206, 209, 211, 212,

    -

    Entities

    (1)

    -

    14,

    -

    Entity

    (1)

    -

    14,

    -

    Entry

    (1)

    -

    238,

    -

    Epiphany

    (1)

    -

    170,

    -

    Equating

    (1)

    -

    7,

    -

    Equivalent

    (1)

    -

    66,

    -

    Equivalents

    (1)

    -

    182,

    -

    Especially

    (1)

    -

    14,

    -

    Established

    (2)

    -

    31, 57,

    -

    Etc

    (14)

    -

    11, 14, 30, 31, 35, 57, 68, 92, 97, 100, 102, 196, 238,

    -

    Evaluator

    (1)

    -

    213,

    -

    Even

    (3)

    -

    16, 59, 230,

    -

    Event

    (1)

    -

    11,

    -

    Every

    (1)

    -

    14,

    -

    Evince

    (1)

    -

    184,

    -

    Exact

    (2)

    -

    124, 125,

    -

    Exactly

    (1)

    -

    152,

    -

    Example

    (11)

    -

    16, 59, 62, 63, 69, 97, 98, 108, 126,

    -

    Examples

    (3)

    -

    68, 95, 178,

    -

    Excellent

    (1)

    -

    159,

    -

    Exceptionally

    (1)

    -

    99,

    -

    Exist

    (3)

    -

    231, 232, 237,

    -

    Existence

    (1)

    -

    15,

    -

    Existing

    (3)

    -

    104, 228, 238,

    -

    Exists

    (4)

    -

    119, 121, 136, 157,

    -

    Exp

    (1)

    -

    37,

    -

    Expand

    (2)

    -

    200, 208,

    -

    Expanded

    (1)

    -

    86,

    -

    Experimental

    (1)

    -

    16,

    -

    Explicit

    (2)

    -

    16, 59,

    -

    Explored

    (1)

    -

    54,

    -

    Explorer

    (1)

    -

    165,

    -

    Expression

    (1)

    -

    40,

    -

    Expressions

    (2)

    -

    16, 59,

    -

    Ext3

    (1)

    -

    45,

    -

    Extend

    (1)

    -

    238,

    -

    Extended

    (4)

    -

    30, 34, 199,

    -

    Extensible

    (1)

    -

    52,

    -

    Extract

    (2)

    -

    16, 59,

    -

    Eye

    (1)

    -

    29,

    -

    Face

    (1)

    -

    83,

    -

    Faces

    (1)

    -

    14,

    -

    Facilitate

    (1)

    -

    194,

    -

    Fairly

    (3)

    -

    57, 68, 238,

    -

    Far

    (2)

    -

    18, 140,

    -

    Faster

    (1)

    -

    152,

    -

    Feature

    (6)

    -

    84, 85, 86, 98, 99, 213,

    -

    Features

    (11)

    -

    8, 15, 21, 23, 24, 84, 159, 182, 186, 187, 198,

    -

    Featuring

    (1)

    -

    6,

    -

    Feed

    (1)

    -

    11,

    -

    Feeds

    (2)

    -

    14, 187,

    -

    Few

    (3)

    -

    8, 51, 202,

    -

    Fewer

    (1)

    -

    182,

    -

    Fields

    (2)

    -

    34, 216,

    -

    File

    (20)

    -

    13, 23, 26, 28, 35, 36, 45, 50, 63, 67, 68, 70, 110, 187, 220, 224, 231,

    -

    Filename

    (1)

    -

    50,

    -

    Files

    (17)

    -

    7, 12, 22, 31, 38, 41, 46, 68, 111, 147, 151, 182, 217, 218,

    -

    Finalized

    (1)

    -

    7,

    -

    Find

    (4)

    -

    15, 35, 98,

    -

    Finn

    (1)

    -

    14,

    -

    Finnish

    (3)

    -

    6, 14,

    -

    Firefox

    (2)

    -

    161,

    -

    First

    (2)

    -

    5, 9,

    -

    Fixed

    (1)

    -

    7,

    -

    Flat

    (2)

    -

    23, 178,

    -

    Flexibility

    (1)

    -

    57,

    -

    Flexible

    (2)

    -

    5, 52,

    -

    Following

    (2)

    -

    72, 96,

    -

    Font

    (4)

    -

    14, 83, 93, 97,

    -

    Footnote

    (2)

    -

    133,

    -

    Footnotes

    (11)

    -

    7, 8, 57, 96, 98, 132, 133, 134,

    -

    For

    (97)

    -

    5, 6, 7, 8, 9, 10, 11, 14, 15, 16, 20, 21, 22, 23, 26, 30, 35, 38, 41, 44, 48, 51, 53, 54, 57, 59, 61, 65, 66, 68, 81, 92, 97, 100, 102, 104, 107, 108, 110, 111, 122, 126, 130, 133, 143, 148, 152, 172, 188, 190, 191, 195, 196, 202, 213, 214, 216, 220, 222, 224, 230, 231, 235, 238,

    -

    Form

    (8)

    -

    15, 42, 98, 148, 152, 201, 207, 216,

    -

    Formally

    (1)

    -

    14,

    -

    Format

    (9)

    -

    9, 22, 44, 97, 179, 180, 214, 237,

    -

    Formats

    (25)

    -

    7, 8, 10, 15, 23, 31, 41, 43, 44, 66, 67, 85, 91, 92, 94, 97, 98, 103, 143, 196, 222,

    -

    Formatted

    (3)

    -

    182, 190, 191,

    -

    Formatting

    (1)

    -

    157,

    -

    Formed

    (3)

    -

    55, 178, 220,

    -

    Forms

    (8)

    -

    8, 97, 154, 188, 195, 219, 220,

    -

    Formulae

    (2)

    -

    23, 53,

    -

    Found

    (13)

    -

    35, 95, 97, 200, 213, 216, 217, 218, 228, 235,

    -

    Four

    (1)

    -

    188,

    -

    Fourth

    (1)

    -

    192,

    -

    Fraction

    (1)

    -

    23,

    -

    Framework

    (3)

    -

    6, 105, 106,

    -

    Free

    (1)

    -

    234,

    -

    Frequently

    (1)

    -

    7,

    -

    From

    (31)

    -

    5, 7, 8, 9, 14, 17, 23, 35, 42, 57, 68, 89, 90, 96, 98, 111, 125, 134, 139, 182, 187, 191, 195, 199,

    -

    Front

    (2)

    -

    197, 199,

    -

    Frontend

    (3)

    -

    198, 199, 217,

    -

    Full

    (7)

    -

    74, 144, 146, 147, 149, 192, 196,

    -

    Functionality

    (1)

    -

    199,

    -

    Functions

    (1)

    -

    55,

    -

    Further

    (7)

    -

    16, 30, 59, 114, 193, 199, 201,

    -

    Future

    (3)

    -

    43, 44, 104,

    -

    Galeon

    (1)

    -

    171,

    -

    Gedit

    (1)

    -

    235,

    -

    General

    (1)

    -

    92,

    -

    Generally

    (1)

    -

    7,

    -

    Generate

    (4)

    -

    11, 12, 61, 111,

    -

    Generated

    (11)

    -

    33, 49, 51, 68, 98, 182, 187, 214, 215, 222,

    -

    Generates

    (3)

    -

    90, 178, 180,

    -

    Generating

    (2)

    -

    16, 59,

    -

    Generation

    (6)

    -

    5, 23, 51, 117, 125, 181,

    -

    Generator

    (1)

    -

    9,

    -

    Generically

    (1)

    -

    224,

    -

    Get

    (7)

    -

    14, 17, 60, 66, 139, 149,

    -

    Gets

    (2)

    -

    104, 136,

    -

    Getting

    (1)

    -

    14,

    -

    Git

    (2)

    -

    231,

    -

    Give

    (1)

    -

    14,

    -

    Given

    (5)

    -

    14, 23, 97, 111,

    -

    Gives

    (2)

    -

    9, 66,

    -

    Giving

    (1)

    -

    196,

    -

    Gladly

    (1)

    -

    14,

    -

    Gnu

    (3)

    -

    15, 227, 228,

    -

    Goal

    (2)

    -

    65, 238,

    -

    GPL

    (2)

    -

    204, 210,

    -

    Granting

    (1)

    -

    57,

    -

    Granular

    (3)

    -

    6, 23, 31,

    -

    Granularity

    (2)

    -

    7, 33,

    -

    Graphical

    (1)

    -

    168,

    -

    Greater

    (2)

    -

    18, 140,

    -

    Guarantee

    (1)

    -

    37,

    -

    Guess

    (1)

    -

    14,

    -

    Guts

    (1)

    -

    14,

    -

    Handles

    (1)

    -

    53,

    -

    Handling

    (4)

    -

    15, 105, 226,

    -

    Hard

    (1)

    -

    14,

    -

    Hardware

    (4)

    -

    13, 45, 187, 199,

    -

    Has

    (21)

    -

    8, 14, 15, 54, 65, 93, 97, 117, 122, 157, 190, 195, 197, 235, 238,

    -

    Hash

    (3)

    -

    37, 57,

    -

    Have

    (24)

    -

    8, 11, 14, 22, 23, 44, 54, 96, 99, 103, 136, 149, 159, 182, 196, 213, 228,

    -

    Having

    (2)

    -

    49, 84,

    -

    Header

    (7)

    -

    16, 30, 40, 57, 59,

    -

    Headers

    (5)

    -

    28, 30, 34, 189, 192,

    -

    Heading

    (7)

    -

    16, 30, 59, 99, 100,

    -

    Headings

    (20)

    -

    7, 16, 31, 33, 57, 59, 92, 96, 98, 99, 128, 129, 130, 141, 196,

    -

    Held

    (1)

    -

    22,

    -

    Helps

    (1)

    -

    14,

    -

    Here

    (5)

    -

    14, 97, 187, 199, 220,

    -

    Hierarchy

    (2)

    -

    136, 224,

    -

    High

    (1)

    -

    15,

    -

    Highlighted

    (2)

    -

    64, 71,

    -

    Highlighting

    (4)

    -

    48, 64, 235,

    -

    Himself

    (1)

    -

    14,

    -

    His

    (1)

    -

    14,

    -

    Historical

    (1)

    -

    149,

    -

    Histories

    (1)

    -

    230,

    -

    How

    (2)

    -

    15, 56,

    -

    However

    (3)

    -

    23, 149, 238,

    -

    Html

    (55)

    -

    8, 11, 17, 18, 22, 23, 25, 29, 31, 32, 35, 44, 51, 55, 65, 66, 68, 73, 74, 93, 94, 97, 98, 102, 108, 126, 134, 139, 140, 143, 144, 145, 147, 151, 153, 154, 158, 160, 176, 182, 192, 195, 196, 200, 214, 222,

    -

    HTML

    (2)

    -

    7, 214,

    -

    Human

    (6)

    -

    8, 11, 25, 29, 92, 107,

    -

    Hundreds

    (1)

    -

    213,

    -

    Hyper

    (4)

    -

    98, 130, 134, 222,

    -

    Hyperesteier

    (1)

    -

    33,

    -

    Hyperlinking

    (1)

    -

    131,

    -

    IBM

    (1)

    -

    213,

    -

    Idea

    (1)

    -

    9,

    -

    Identical

    (1)

    -

    98,

    -

    Identifiable

    (1)

    -

    190,

    -

    Identified

    (2)

    -

    49, 84,

    -

    Identifies

    (1)

    -

    222,

    -

    Identify

    (3)

    -

    16, 59, 222,

    -

    IEC

    (1)

    -

    179,

    -

    Ignored

    (3)

    -

    14, 188, 197,

    -

    Image

    (3)

    -

    95, 100, 110,

    -

    Images

    (13)

    -

    7, 31, 50, 53, 57, 85, 92, 96, 99, 116, 122, 187,

    -

    Implicit

    (5)

    -

    16, 19, 31, 59, 141,

    -

    Import

    (1)

    -

    42,

    -

    Important

    (1)

    -

    84,

    -

    Impressive

    (1)

    -

    15,

    -

    Inclination

    (1)

    -

    14,

    -

    Include

    (5)

    -

    7, 31, 57, 68, 228,

    -

    Includes

    (1)

    -

    200,

    -

    Including

    (12)

    -

    8, 9, 14, 18, 20, 22, 85, 100, 140, 160, 189, 198,

    -

    Inclusion

    (1)

    -

    125,

    -

    Inclusions

    (1)

    -

    50,

    -

    Incomplete

    (1)

    -

    83,

    -

    Incorporate

    (1)

    -

    238,

    -

    Incorporated

    (1)

    -

    196,

    -

    Increment

    (1)

    -

    133,

    -

    Incremental

    (1)

    -

    99,

    -

    Indeed

    (2)

    -

    57, 200,

    -

    Indent

    (1)

    -

    85,

    -

    Independent

    (2)

    -

    98,

    -

    Independently

    (1)

    -

    98,

    -

    Index

    (7)

    -

    95, 151, 214, 216, 221, 222,

    -

    Indexed

    (2)

    -

    103, 196,

    -

    Indexing

    (2)

    -

    33,

    -

    Individual

    (7)

    -

    31, 103, 136, 190, 196, 226, 231,

    -

    Individually

    (1)

    -

    30,

    -

    Industry

    (1)

    -

    31,

    -

    Influenced

    (1)

    -

    25,

    -

    Info

    (1)

    -

    68,

    -

    Information

    (43)

    -

    5, 6, 8, 14, 16, 22, 30, 33, 34, 57, 59, 100, 107, 108, 123, 125, 126, 186, 187, 190, 194, 196, 200,

    -

    Infrastructure

    (1)

    -

    197,

    -

    Initial

    (1)

    -

    61,

    -

    Inner

    (1)

    -

    14,

    -

    Innovation

    (2)

    -

    205, 211,

    -

    Innovations

    (1)

    -

    213,

    -

    Input

    (7)

    -

    29, 44, 68, 187, 220, 238,

    -

    Installed

    (1)

    -

    49,

    -

    Instance

    (2)

    -

    14, 39,

    -

    Instead

    (2)

    -

    14, 96,

    -

    Institutionally

    (1)

    -

    31,

    -

    Instruction

    (2)

    -

    57, 226,

    -

    Instructions

    (3)

    -

    30, 57, 235,

    -

    Integrated

    (1)

    -

    37,

    -

    Integration

    (1)

    -

    227,

    -

    Intended

    (2)

    -

    98, 238,

    -

    Interest

    (2)

    -

    8, 213,

    -

    Interesting

    (1)

    -

    188,

    -

    Interfere

    (1)

    -

    98,

    -

    Intermediate

    (3)

    -

    22,

    -

    International

    (2)

    -

    63, 70,

    -

    Internet

    (1)

    -

    165,

    -

    Into

    (7)

    -

    16, 22, 23, 42, 59, 151, 187,

    -

    Introduced

    (6)

    -

    104, 182, 187, 222,

    -

    Invaluable

    (1)

    -

    92,

    -

    Invention

    (2)

    -

    205, 211,

    -

    Investigation

    (1)

    -

    15,

    -

    ISO

    (1)

    -

    179,

    -

    Italics

    (3)

    -

    30, 83, 85,

    -

    Its

    (13)

    -

    15, 16, 35, 49, 54, 57, 59, 61, 97, 190,

    -

    July

    (1)

    -

    187,

    -

    June

    (1)

    -

    182,

    -

    Just

    (8)

    -

    14, 33, 66, 95, 103, 195, 196,

    -

    Kate

    (1)

    -

    235,

    -

    Kazehakase

    (1)

    -

    162,

    -

    Keep

    (3)

    -

    8, 65, 85,

    -

    Keeping

    (1)

    -

    238,

    -

    Kept

    (3)

    -

    83, 84, 99,

    -

    Kind

    (1)

    -

    14,

    -

    Kinds

    (1)

    -

    14,

    -

    Know

    (3)

    -

    35, 152,

    -

    Knowledge

    (2)

    -

    9, 55,

    -

    Known

    (1)

    -

    15,

    -

    Konqueror

    (1)

    -

    163,

    -

    Kwrite

    (1)

    -

    235,

    -

    Laid

    (1)

    -

    68,

    -

    Landscape

    (8)

    -

    68, 75, 97, 140, 181, 182,

    -

    Language

    (5)

    -

    50, 205, 206, 211, 212,

    -

    Languages

    (1)

    -

    15,

    -

    Large

    (4)

    -

    14, 15, 90, 213,

    -

    Largely

    (6)

    -

    22, 55, 57, 157, 197,

    -

    Larger

    (2)

    -

    46, 197,

    -

    Last

    (1)

    -

    111,

    -

    Lasts

    (1)

    -

    14,

    -

    Later

    (1)

    -

    187,

    -

    Latex

    (24)

    -

    7, 8, 11, 16, 18, 22, 31, 44, 51, 55, 59, 68, 140, 141, 143, 181, 182, 200, 228,

    -

    Law

    (4)

    -

    8, 15, 23, 53,

    -

    Layout

    (1)

    -

    57,

    -

    Leading

    (1)

    -

    15,

    -

    Legal

    (3)

    -

    8, 23, 53,

    -

    Length

    (5)

    -

    74, 144, 146, 147, 149,

    -

    Less

    (2)

    -

    11, 110,

    -

    Letter

    (2)

    -

    14,

    -

    Level

    (14)

    -

    14, 16, 23, 31, 33, 39, 59, 194, 232,

    -

    Levels

    (5)

    -

    16, 30, 57, 59, 85,

    -

    Lex

    (4)

    -

    15, 23, 97,

    -

    Libraries

    (1)

    -

    14,

    -

    Life

    (2)

    -

    14,

    -

    Lighter

    (1)

    -

    168,

    -

    Like

    (6)

    -

    28, 33, 51, 95, 159,

    -

    Limitations

    (1)

    -

    23,

    -

    Limited

    (2)

    -

    84, 85,

    -

    Line

    (1)

    -

    6,

    -

    Lines

    (1)

    -

    14,

    -

    Linked

    (5)

    -

    68, 98, 134, 222,

    -

    Linking

    (1)

    -

    130,

    -

    Links

    (3)

    -

    196, 214, 216,

    -

    Links2

    (1)

    -

    174,

    -

    Linux

    (5)

    -

    15, 203, 209, 227, 228,

    -

    List

    (1)

    -

    222,

    -

    Listed

    (1)

    -

    8,

    -

    Literature

    (2)

    -

    28, 53,

    -

    Little

    (7)

    -

    11, 14, 16, 59, 136, 197,

    -

    Live

    (2)

    -

    95, 196,

    -

    Load

    (2)

    -

    121,

    -

    Loading

    (2)

    -

    152, 186,

    -

    Loads

    (2)

    -

    22, 194,

    -

    Locally

    (3)

    -

    49, 50,

    -

    Located

    (1)

    -

    97,

    -

    Locating

    (1)

    -

    7,

    -

    Location

    (1)

    -

    103,

    -

    Locations

    (12)

    -

    7, 36, 196, 200, 213, 214, 216, 217, 222,

    -

    Logical

    (1)

    -

    194,

    -

    Long

    (3)

    -

    149, 159,

    -

    Longer

    (1)

    -

    152,

    -

    Look

    (4)

    -

    16, 33, 59, 98,

    -

    Looking

    (3)

    -

    65, 136, 152,

    -

    Loose

    (1)

    -

    14,

    -

    Lout

    (5)

    -

    8, 11, 22, 143, 182,

    -

    Low

    (1)

    -

    65,

    -

    Machine

    (4)

    -

    32, 49, 91, 92,

    -

    Made

    (8)

    -

    14, 16, 22, 57, 59, 110, 121, 215,

    -

    Mail

    (1)

    -

    25,

    -

    Main

    (1)

    -

    42,

    -

    Maintained

    (1)

    -

    14,

    -

    Maintenance

    (1)

    -

    157,

    -

    Make

    (3)

    -

    8, 65, 196,

    -

    Makes

    (9)

    -

    8, 9, 14, 38, 57, 90, 196, 199, 228,

    -

    Making

    (5)

    -

    8, 31, 37, 105, 222,

    -

    Man

    (2)

    -

    14, 32,

    -

    Managed

    (1)

    -

    223,

    -

    Management

    (5)

    -

    35, 47, 109, 110, 213,

    -

    Manifests

    (1)

    -

    31,

    -

    Manner

    (1)

    -

    92,

    -

    Manually

    (2)

    -

    99, 133,

    -

    Many

    (10)

    -

    12, 14, 15, 16, 40, 59, 143, 159, 226, 230,

    -

    Map

    (2)

    -

    195, 221,

    -

    Mapped

    (2)

    -

    35, 99,

    -

    Mapping

    (1)

    -

    92,

    -

    Mappings

    (1)

    -

    92,

    -

    Margins

    (1)

    -

    98,

    -

    Marked

    (7)

    -

    26, 42, 46, 68, 90, 234,

    -

    Markers

    (1)

    -

    100,

    -

    Markup

    (54)

    -

    6, 7, 8, 10, 11, 14, 16, 17, 22, 25, 26, 28, 29, 30, 38, 40, 48, 49, 57, 58, 59, 60, 62, 63, 64, 65, 66, 67, 69, 83, 85, 111, 139, 187, 194, 195,

    -

    Marshal

    (1)

    -

    220,

    -

    Master

    (2)

    -

    42, 50,

    -

    Match

    (3)

    -

    30, 214, 215,

    -

    Matched

    (6)

    -

    200, 213,

    -

    Matches

    (4)

    -

    7, 200, 213, 214,

    -

    Matching

    (5)

    -

    33, 203, 208, 209, 238,

    -

    Material

    (2)

    -

    5, 9,

    -

    Mature

    (2)

    -

    55,

    -

    May

    (36)

    -

    7, 14, 29, 30, 35, 40, 41, 43, 44, 49, 50, 86, 98, 99, 103, 107, 133, 136, 148, 213, 214, 215, 222, 235,

    -

    Md5

    (2)

    -

    31, 37,

    -

    Meaning

    (3)

    -

    6, 14,

    -

    Meaningful

    (3)

    -

    14, 32, 38,

    -

    Means

    (6)

    -

    7, 14, 90, 97, 107, 222,

    -

    Meet

    (2)

    -

    213,

    -

    Meeting

    (1)

    -

    213,

    -

    Mentioned

    (1)

    -

    228,

    -

    Mercatoria

    (4)

    -

    15, 23, 97,

    -

    Mercurial

    (1)

    -

    231,

    -

    Messaging

    (1)

    -

    25,

    -

    Met

    (1)

    -

    7,

    -

    Meta

    (6)

    -

    8, 34, 105, 106, 107,

    -

    Metadata

    (3)

    -

    5, 108, 126,

    -

    Metatags

    (2)

    -

    108, 125,

    -

    Method

    (1)

    -

    125,

    -

    Mind

    (1)

    -

    87,

    -

    Minimal

    (4)

    -

    15, 22, 26, 238,

    -

    Minimalism

    (1)

    -

    65,

    -

    Minimalist

    (1)

    -

    38,

    -

    Minimalistic

    (4)

    -

    6, 28, 57, 66,

    -

    Minimalistically

    (1)

    -

    68,

    -

    Minimum

    (3)

    -

    8, 16, 59,

    -

    Minor

    (1)

    -

    99,

    -

    Mnemonic

    (3)

    -

    25, 28, 83,

    -

    Mode

    (2)

    -

    214, 215,

    -

    Modified

    (2)

    -

    90, 136,

    -

    Modifiers

    (1)

    -

    83,

    -

    Modify

    (1)

    -

    44,

    -

    Modular

    (2)

    -

    44, 236,

    -

    Module

    (5)

    -

    44, 178, 220, 236, 237,

    -

    Modules

    (4)

    -

    55, 68, 90, 143,

    -

    More

    (20)

    -

    14, 17, 21, 29, 35, 55, 66, 98, 110, 125, 139, 147, 151, 176, 182, 188, 224, 231,

    -

    Most

    (12)

    -

    15, 28, 29, 55, 60, 84, 85, 96, 145, 148, 216,

    -

    Mostly

    (1)

    -

    5,

    -

    Mozilla

    (2)

    -

    161, 164,

    -

    Much

    (6)

    -

    9, 11, 16, 59, 95, 125,

    -

    Multiple

    (13)

    -

    6, 9, 10, 26, 31, 50, 67, 92, 94, 138, 140, 178, 226,

    -

    Must

    (3)

    -

    14, 57, 125,

    -

    Name

    (7)

    -

    14, 111, 117, 121, 224,

    -

    Named

    (3)

    -

    130, 147, 151,

    -

    Names

    (1)

    -

    213,

    -

    Natural

    (1)

    -

    110,

    -

    Naturally

    (1)

    -

    190,

    -

    Navigate

    (1)

    -

    88,

    -

    Nbsp

    (34)

    -

    112,

    -

    Near

    (2)

    -

    108, 126,

    -

    Necessarily

    (2)

    -

    98, 149,

    -

    Necessary

    (2)

    -

    15, 238,

    -

    Need

    (9)

    -

    8, 12, 15, 40, 44, 46, 51, 236, 237,

    -

    Needed

    (3)

    -

    16, 59, 194,

    -

    Needs

    (2)

    -

    41, 201,

    -

    Net

    (2)

    -

    49, 50,

    -

    Netscape

    (1)

    -

    166,

    -

    New

    (11)

    -

    41, 43, 44, 57, 90, 220, 236, 237, 238,

    -

    Next

    (1)

    -

    63,

    -

    Nicknamed

    (1)

    -

    14,

    -

    Not

    (21)

    -

    7, 14, 16, 23, 53, 57, 59, 90, 93, 96, 98, 99, 104, 122, 133, 149, 237,

    -

    Note

    (3)

    -

    104, 207, 213,

    -

    Noted

    (1)

    -

    228,

    -

    Notes

    (1)

    -

    27,

    -

    November

    (1)

    -

    5,

    -

    Now

    (1)

    -

    15,

    -

    Number

    (19)

    -

    8, 15, 16, 48, 59, 97, 100, 125, 132, 133, 190, 194, 213, 214, 216, 235,

    -

    Numbered

    (10)

    -

    7, 35, 57, 98, 100, 104, 130, 133,

    -

    Numbering

    (35)

    -

    7, 9, 15, 32, 91, 92, 96, 97, 98, 100, 101, 102, 103, 104, 129, 131, 186, 196, 198, 199, 200,

    -

    Numbers

    (12)

    -

    33, 57, 96, 98, 99, 103, 133, 196, 202, 214, 215, 222,

    -

    Nurmi

    (1)

    -

    14,

    -

    Object

    (40)

    -

    7, 9, 23, 31, 32, 33, 36, 37, 57, 91, 92, 98, 99, 100, 102, 103, 104, 186, 190, 194, 195, 196, 198, 199, 200, 202, 213, 214, 215, 218,

    -

    Objects

    (12)

    -

    7, 35, 57, 92, 103, 194, 196, 200, 213,

    -

    Obtained

    (1)

    -

    125,

    -

    Obviating

    (1)

    -

    40,

    -

    Occasionally

    (1)

    -

    152,

    -

    Occurrences

    (1)

    -

    216,

    -

    OCN

    (2)

    -

    214,

    -

    Ocn

    (3)

    -

    32, 55, 104,

    -

    October

    (1)

    -

    182,

    -

    ODF

    (7)

    -

    11, 16, 18, 31, 51, 59, 179,

    -

    ODT

    (1)

    -

    179,

    -

    Old

    (2)

    -

    8, 230,

    -

    Older

    (1)

    -

    176,

    -

    Oldest

    (2)

    -

    8, 23,

    -

    Omitted

    (1)

    -

    214,

    -

    Once

    (5)

    -

    10, 30, 40, 41, 97,

    -

    One

    (15)

    -

    8, 15, 23, 61, 68, 143, 148, 188, 189, 190, 192, 200,

    -

    Ones

    (1)

    -

    57,

    -

    Oneself

    (1)

    -

    14,

    -

    Only

    (13)

    -

    9, 14, 23, 41, 46, 51, 68, 96, 134, 148, 152, 196, 201,

    -

    Open

    (4)

    -

    31, 179, 180,

    -

    Opendocument

    (6)

    -

    7, 11, 16, 18, 59, 214,

    -

    Opening

    (1)

    -

    213,

    -

    Opera

    (1)

    -

    167,

    -

    Operation

    (1)

    -

    132,

    -

    Operations

    (1)

    -

    49,

    -

    Opportunity

    (1)

    -

    68,

    -

    Opposed

    (1)

    -

    7,

    -

    Optimism

    (1)

    -

    14,

    -

    Optional

    (3)

    -

    22, 28, 57,

    -

    Options

    (1)

    -

    216,

    -

    Order

    (1)

    -

    111,

    -

    Organised

    (1)

    -

    35,

    -

    Oriented

    (3)

    -

    6, 32, 182,

    -

    Original

    (1)

    -

    70,

    -

    Originally

    (1)

    -

    15,

    -

    Other

    (20)

    -

    8, 14, 23, 42, 57, 92, 104, 140, 143, 159, 186, 187, 189, 194, 200, 219, 220, 222,

    -

    Others

    (2)

    -

    31, 55,

    -

    Out

    (3)

    -

    60, 68, 159,

    -

    Outburst

    (1)

    -

    14,

    -

    Outline

    (1)

    -

    4,

    -

    Output

    (68)

    -

    7, 8, 15, 16, 18, 19, 23, 31, 33, 35, 41, 43, 44, 55, 59, 61, 63, 65, 66, 67, 68, 72, 85, 88, 90, 91, 92, 94, 97, 98, 102, 110, 111, 134, 137, 140, 141, 143, 176, 180, 181, 188, 192, 195, 196, 214, 216, 224, 237,

    -

    Outputs

    (18)

    -

    6, 20, 22, 23, 26, 31, 32, 33, 49, 50, 138, 154, 182, 200,

    -

    Outside

    (1)

    -

    14,

    -

    Over

    (1)

    -

    119,

    -

    Overridden

    (1)

    -

    113,

    -

    Override

    (1)

    -

    99,

    -

    Own

    (4)

    -

    14, 96, 98, 113,

    -

    Owner

    (1)

    -

    111,

    -

    Paavo

    (1)

    -

    14,

    -

    Page

    (2)

    -

    93, 97,

    -

    Pages

    (3)

    -

    97, 182,

    -

    Paper

    (3)

    -

    8, 32, 182,

    -

    Paragraph

    (27)

    -

    7, 9, 15, 16, 31, 33, 59, 91, 96, 97, 98, 100, 102, 190, 191, 215, 218, 222,

    -

    Paragraphs

    (15)

    -

    7, 16, 31, 33, 57, 59, 92, 96, 97, 98, 99, 141, 200, 213,

    -

    Parameters

    (1)

    -

    238,

    -

    Paraphrase

    (1)

    -

    213,

    -

    Parchment

    (1)

    -

    149,

    -

    Parsed

    (1)

    -

    29,

    -

    Part

    (7)

    -

    5, 40, 44, 98, 99, 111, 122,

    -

    Particular

    (3)

    -

    15, 23, 125,

    -

    Particularly

    (2)

    -

    7, 97,

    -

    Parts

    (10)

    -

    14, 16, 42, 59, 141, 194,

    -

    Passes

    (1)

    -

    201,

    -

    Passion

    (1)

    -

    14,

    -

    Patience

    (1)

    -

    14,

    -

    Pattern

    (1)

    -

    30,

    -

    PDF

    (10)

    -

    7, 31, 32, 35, 44, 51, 181, 182, 200,

    -

    Pdf

    (24)

    -

    8, 16, 18, 22, 55, 59, 68, 75, 76, 94, 97, 102, 134, 140, 141, 143, 181, 182, 192, 214, 228,

    -

    PDFs

    (1)

    -

    31,

    -

    Pdfs

    (1)

    -

    182,

    -

    Peace

    (1)

    -

    28,

    -

    People

    (1)

    -

    91,

    -

    Per

    (4)

    -

    14, 99, 136, 230,

    -

    Performed

    (1)

    -

    7,

    -

    Perl

    (1)

    -

    15,

    -

    Permit

    (2)

    -

    14, 34,

    -

    Permits

    (3)

    -

    42, 90, 125,

    -

    Perseverance

    (1)

    -

    14,

    -

    Personal

    (2)

    -

    231, 238,

    -

    Pet

    (1)

    -

    14,

    -

    Pgsql

    (3)

    -

    102, 143, 188,

    -

    Philosophy

    (1)

    -

    65,

    -

    PHP

    (2)

    -

    8, 68,

    -

    Pinpoint

    (1)

    -

    8,

    -

    Pinpointing

    (2)

    -

    9, 97,

    -

    Place

    (3)

    -

    9, 15, 197,

    -

    Placed

    (4)

    -

    5, 98, 113,

    -

    Placing

    (1)

    -

    224,

    -

    Plain

    (2)

    -

    7, 214,

    -

    Plaintext

    (3)

    -

    31, 49, 51,

    -

    Platform

    (1)

    -

    15,

    -

    Point

    (2)

    -

    134,

    -

    Points

    (1)

    -

    8,

    -

    Populate

    (1)

    -

    7,

    -

    Populated

    (2)

    -

    31, 33,

    -

    Populating

    (1)

    -

    186,

    -

    Portability

    (1)

    -

    8,

    -

    Portrait

    (8)

    -

    68, 76, 97, 140, 181, 182,

    -

    Position

    (1)

    -

    96,

    -

    Positioning

    (4)

    -

    9, 91, 92,

    -

    Possibilities

    (6)

    -

    18, 19, 31, 140, 141, 196,

    -

    Possibility

    (6)

    -

    9, 14, 36, 124, 157, 193,

    -

    Possible

    (13)

    -

    8, 9, 15, 31, 33, 37, 42, 57, 99, 137, 195, 196, 222,

    -

    Possibly

    (1)

    -

    14,

    -

    Posted

    (1)

    -

    49,

    -

    PostgreSQL

    (12)

    -

    8, 11, 22, 31, 55, 68, 81, 97, 186, 187, 198, 228,

    -

    Postgresql

    (6)

    -

    13, 32, 45, 140, 187, 199,

    -

    Posting

    (1)

    -

    125,

    -

    Postscript

    (3)

    -

    16, 59, 141,

    -

    Potentially

    (1)

    -

    8,

    -

    Powerful

    (1)

    -

    196,

    -

    Practical

    (1)

    -

    159,

    -

    Practice

    (1)

    -

    11,

    -

    Practices

    (1)

    -

    25,

    -

    Pre

    (4)

    -

    14, 42, 220,

    -

    Precedence

    (3)

    -

    119, 120, 136,

    -

    Precise

    (1)

    -

    196,

    -

    Precisely

    (2)

    -

    147, 151,

    -

    Predating

    (1)

    -

    5,

    -

    Prefer

    (1)

    -

    238,

    -

    Preference

    (2)

    -

    231, 238,

    -

    Preparation

    (3)

    -

    5, 35, 66,

    -

    Prepare

    (1)

    -

    49,

    -

    Prepared

    (5)

    -

    28, 35, 41, 49, 234,

    -

    Present

    (2)

    -

    110, 201,

    -

    Presentation

    (3)

    -

    8, 90,

    -

    Presentations

    (7)

    -

    8, 41, 92, 100, 143, 144, 146,

    -

    Pretty

    (1)

    -

    196,

    -

    Previous

    (1)

    -

    110,

    -

    Previously

    (1)

    -

    9,

    -

    Primarily

    (2)

    -

    31, 65,

    -

    Principles

    (2)

    -

    147, 151,

    -

    Printed

    (1)

    -

    97,

    -

    Prior

    (3)

    -

    14, 125, 220,

    -

    Problem

    (1)

    -

    220,

    -

    Process

    (2)

    -

    41, 42,

    -

    Processable

    (1)

    -

    107,

    -

    Processed

    (3)

    -

    49, 50,

    -

    Processes

    (1)

    -

    22,

    -

    Processing

    (8)

    -

    22, 30, 57, 220, 225, 226,

    -

    Processor

    (2)

    -

    12, 51,

    -

    Produce

    (8)

    -

    7, 15, 22, 49, 50, 182, 220,

    -

    Produced

    (9)

    -

    10, 31, 41, 44, 46, 68, 98, 136, 145,

    -

    Produces

    (7)

    -

    22, 31, 72, 90, 97, 128, 222,

    -

    Producing

    (1)

    -

    23,

    -

    Professional

    (3)

    -

    16, 59, 141,

    -

    Program

    (8)

    -

    14, 15, 44, 57, 61, 90, 182, 220,

    -

    Programming

    (1)

    -

    15,

    -

    Programs

    (1)

    -

    228,

    -

    Project

    (4)

    -

    15, 39, 197,

    -

    Projects

    (2)

    -

    8, 15,

    -

    Proof

    (1)

    -

    55,

    -

    Proofing

    (2)

    -

    43, 44,

    -

    Proved

    (1)

    -

    22,

    -

    Provide

    (4)

    -

    40, 103, 200, 230,

    -

    Provided

    (22)

    -

    14, 16, 21, 22, 57, 59, 63, 68, 95, 98, 108, 126, 132, 133, 148, 152, 178, 196, 214, 235,

    -

    Provides

    (11)

    -

    7, 33, 57, 67, 85, 97, 107, 111, 125, 224, 230,

    -

    Providing

    (6)

    -

    16, 57, 59, 98, 107, 182,

    -

    Provisional

    (1)

    -

    14,

    -

    Prune

    (1)

    -

    187,

    -

    Publication

    (3)

    -

    8, 14, 23,

    -

    Publish

    (1)

    -

    8,

    -

    Published

    (3)

    -

    7, 57, 215,

    -

    Publisher

    (1)

    -

    99,

    -

    Publishing

    (8)

    -

    5, 6, 15, 57, 84, 85, 86, 92,

    -

    Purposes

    (2)

    -

    57, 107,

    -

    Quality

    (1)

    -

    15,

    -

    Query

    (1)

    -

    199,

    -

    Quite

    (1)

    -

    190,

    -

    Rajanen

    (1)

    -

    14,

    -

    Ralph

    (3)

    -

    [H]1, 37, 112,

    -

    Range

    (3)

    -

    15, 54, 137,

    -

    Rather

    (7)

    -

    9, 23, 98, 111, 117, 231,

    -

    RCS

    (3)

    -

    37, 230, 231,

    -

    Rdiff

    (1)

    -

    232,

    -

    Read

    (3)

    -

    82, 88, 195,

    -

    Readable

    (5)

    -

    8, 11, 25, 29, 107,

    -

    Reader

    (1)

    -

    183,

    -

    Really

    (1)

    -

    14,

    -

    Reason

    (1)

    -

    14,

    -

    Reasonable

    (1)

    -

    65,

    -

    Reasons

    (2)

    -

    41, 148,

    -

    Recent

    (2)

    -

    182, 231,

    -

    Records

    (1)

    -

    97,

    -

    Reduced

    (2)

    -

    16, 59,

    -

    Reducing

    (3)

    -

    16, 40, 59,

    -

    Refer

    (1)

    -

    200,

    -

    Reference

    (7)

    -

    7, 32, 97, 134, 149, 214,

    -

    Referenced

    (6)

    -

    7, 57, 96, 98, 191,

    -

    References

    (3)

    -

    14, 95, 122,

    -

    Referring

    (1)

    -

    125,

    -

    Refers

    (1)

    -

    14,

    -

    Regard

    (1)

    -

    100,

    -

    Regardless

    (3)

    -

    97, 98, 101,

    -

    Regards

    (2)

    -

    187, 199,

    -

    Regular

    (3)

    -

    16, 40, 59,

    -

    Reiserfs

    (2)

    -

    13, 45,

    -

    Related

    (5)

    -

    5, 30, 31, 57, 187,

    -

    Relation

    (4)

    -

    23, 53, 57, 192,

    -

    Relational

    (12)

    -

    8, 9, 20, 23, 45, 103, 104, 186, 187, 194, 196,

    -

    Relatively

    (2)

    -

    82, 238,

    -

    Release

    (4)

    -

    14, 104,

    -

    Released

    (2)

    -

    104, 178,

    -

    Relevant

    (9)

    -

    20, 23, 33, 92, 121, 196, 200, 214, 215,

    -

    Reliably

    (1)

    -

    8,

    -

    Remains

    (5)

    -

    9, 91, 97, 98, 101,

    -

    Remember

    (1)

    -

    82,

    -

    Remind

    (1)

    -

    159,

    -

    Remote

    (4)

    -

    49,

    -

    Removed

    (1)

    -

    14,

    -

    Rendered

    (1)

    -

    176,

    -

    Rendition

    (1)

    -

    214,

    -

    Repeated

    (2)

    -

    8, 41,

    -

    Repository

    (1)

    -

    231,

    -

    Representation

    (5)

    -

    15, 16, 59, 141,

    -

    Representations

    (4)

    -

    29, 41, 43, 44,

    -

    Representing

    (4)

    -

    16, 57, 59,

    -

    Request

    (1)

    -

    66,

    -

    Requests

    (1)

    -

    120,

    -

    Require

    (1)

    -

    28,

    -

    Required

    (17)

    -

    8, 11, 16, 22, 30, 44, 59, 61, 65, 86, 104, 182, 220,

    -

    Requireing

    (1)

    -

    110,

    -

    Requirement

    (2)

    -

    26, 60,

    -

    Requirements

    (4)

    -

    8, 54, 65, 238,

    -

    Requiring

    (3)

    -

    16, 59, 92,

    -

    Resource

    (2)

    -

    105, 106,

    -

    Result

    (2)

    -

    103, 216,

    -

    Resulted

    (1)

    -

    8,

    -

    Results

    (12)

    -

    20, 95, 103, 195, 196, 200, 213, 216, 217, 218,

    -

    Retain

    (1)

    -

    125,

    -

    Retained

    (2)

    -

    187, 188,

    -

    Retaining

    (5)

    -

    8, 16, 59, 186, 187,

    -

    Return

    (1)

    -

    196,

    -

    Returned

    (1)

    -

    7,

    -

    Richard

    (2)

    -

    204, 210,

    -

    Richer

    (1)

    -

    195,

    -

    Richly

    (1)

    -

    68,

    -

    Rights

    (1)

    -

    57,

    -

    Rise

    (1)

    -

    196,

    -

    Roughly

    (3)

    -

    14, 33, 92,

    -

    Rss

    (1)

    -

    14,

    -

    Rsync

    (1)

    -

    49,

    -

    Ruby

    (5)

    -

    15, 44, 51, 182, 220,

    -

    Rudimentary

    (4)

    -

    128, 201, 221, 222,

    -

    Rule

    (2)

    -

    147, 151,

    -

    Rules

    (3)

    -

    44, 99, 113,

    -

    Run

    (5)

    -

    15, 44, 49, 111, 125,

    -

    Running

    (1)

    -

    50,

    -

    Sabaki

    (2)

    -

    14,

    -

    Said

    (1)

    -

    213,

    -

    Same

    (16)

    -

    29, 91, 97, 98, 101, 102, 103, 117, 121, 182, 187, 196, 208,

    -

    Sample

    (4)

    -

    198, 199, 201,

    -

    Saved

    (2)

    -

    147, 151,

    -

    Saving

    (1)

    -

    182,

    -

    Sax

    (1)

    -

    78,

    -

    SAX

    (2)

    -

    178,

    -

    Saying

    (2)

    -

    16, 59,

    -

    Says

    (1)

    -

    14,

    -

    Scalability

    (2)

    -

    13, 45,

    -

    Scalable

    (2)

    -

    187, 199,

    -

    Scale

    (2)

    -

    90, 197,

    -

    Scaling

    (2)

    -

    187, 199,

    -

    Scholarly

    (1)

    -

    8,

    -

    Scp

    (1)

    -

    49,

    -

    Screen

    (2)

    -

    32, 182,

    -

    Scribbler

    (2)

    -

    14,

    -

    Scribe

    (2)

    -

    14,

    -

    Scroll

    (4)

    -

    146, 149,

    -

    Scrollable

    (1)

    -

    147,

    -

    Scrolling

    (1)

    -

    149,

    -

    Search

    (41)

    -

    5, 6, 7, 16, 20, 31, 33, 35, 55, 57, 59, 95, 103, 186, 195, 196, 198, 199, 201, 202, 213, 216, 217, 218, 222,

    -

    Searchable

    (2)

    -

    33, 190,

    -

    Searched

    (1)

    -

    216,

    -

    Searches

    (7)

    -

    7, 103, 141, 196, 202, 207, 208,

    -

    Searching

    (3)

    -

    190, 191, 194,

    -

    Second

    (1)

    -

    22,

    -

    Section

    (7)

    -

    5, 63, 98, 108, 126,

    -

    Secure

    (1)

    -

    46,

    -

    See

    (4)

    -

    5, 21, 68, 95,

    -

    Seek

    (1)

    -

    14,

    -

    Seems

    (1)

    -

    65,

    -

    Segment

    (2)

    -

    149, 152,

    -

    Segmented

    (10)

    -

    73, 98, 108, 126, 144, 145, 148, 149, 150, 152,

    -

    Segments

    (2)

    -

    151, 152,

    -

    Select

    (1)

    -

    216,

    -

    Selected

    (1)

    -

    137,

    -

    Semantic

    (12)

    -

    8, 14, 30, 34, 107, 189, 238,

    -

    Sensitive

    (1)

    -

    207,

    -

    Sent

    (1)

    -

    50,

    -

    Separate

    (2)

    -

    89, 98,

    -

    Separated

    (1)

    -

    90,

    -

    Separately

    (2)

    -

    7, 57,

    -

    Separating

    (1)

    -

    90,

    -

    Sequence

    (1)

    -

    98,

    -

    Sequential

    (2)

    -

    92, 96,

    -

    Sequentially

    (3)

    -

    96, 98, 100,

    -

    Serialized

    (3)

    -

    6, 14,

    -

    Serializes

    (1)

    -

    57,

    -

    Server

    (3)

    -

    49,

    -

    Set

    (13)

    -

    14, 44, 46, 55, 84, 85, 86, 90, 110, 141, 176, 213,

    -

    Setting

    (1)

    -

    97,

    -

    Settings

    (1)

    -

    97,

    -

    Several

    (12)

    -

    8, 14, 54, 65, 84, 143, 144, 158, 160, 196, 216, 232,

    -

    Sha256

    (1)

    -

    31,

    -

    Shallow

    (1)

    -

    178,

    -

    Share

    (2)

    -

    7, 32,

    -

    Shared

    (3)

    -

    49, 50,

    -

    Sheet

    (1)

    -

    153,

    -

    Sheets

    (1)

    -

    42,

    -

    Short

    (3)

    -

    21, 24, 148,

    -

    Should

    (7)

    -

    42, 44, 99, 104, 188, 228,

    -

    Show

    (5)

    -

    216,

    -

    Showing

    (2)

    -

    202, 208,

    -

    Shows

    (1)

    -

    216,

    -

    Similar

    (1)

    -

    113,

    -

    Similarly

    (1)

    -

    97,

    -

    Simple

    (17)

    -

    8, 9, 11, 14, 15, 58, 83, 84, 85, 92, 99, 201, 217, 218, 238,

    -

    Simpler

    (8)

    -

    11, 17, 25, 29, 66, 110, 139,

    -

    Simplicity

    (1)

    -

    65,

    -

    Simply

    (1)

    -

    195,

    -

    Since

    (2)

    -

    5, 197,

    -

    Single

    (8)

    -

    26, 28, 67, 96, 145, 147, 149, 226,

    -

    Sisu

    (12)

    -

    6, 14, 42, 49, 50, 112, 214,

    -

    SiSU

    (73)

    -

    [H]1, [H]2, 5, 6, 7, 8, 9, 11, 12, 14, 15, 16, 22, 23, 37, 38, 48, 49, 53, 54, 57, 59, 66, 83, 85, 92, 97, 106, 111, 125, 128, 132, 136, 154, 178, 180, 182, 187, 194, 196, 198, 200, 214, 215, 222, 224, 226, 228, 234, 235,

    -

    Sisupod

    (1)

    -

    50,

    -

    Site

    (13)

    -

    5, 39, 97, 122, 136, 182, 187, 188, 223, 224,

    -

    Sitemaps

    (1)

    -

    31,

    -

    Sites

    (3)

    -

    23, 49, 195,

    -

    Size

    (2)

    -

    93, 97,

    -

    Sized

    (1)

    -

    7,

    -

    Skin

    (18)

    -

    35, 117, 118, 119, 120, 121, 136,

    -

    Skinnable

    (4)

    -

    39, 135, 136, 137,

    -

    Skins

    (4)

    -

    110, 115, 121,

    -

    Small

    (4)

    -

    14, 44, 61, 199,

    -

    Smaller

    (1)

    -

    152,

    -

    Smallest

    (3)

    -

    16, 59, 194,

    -

    Smart

    (1)

    -

    65,

    -

    Snap

    (1)

    -

    220,

    -

    Softlink

    (1)

    -

    121,

    -

    Software

    (3)

    -

    14, 51, 213,

    -

    Soldier

    (1)

    -

    14,

    -

    Solution

    (1)

    -

    49,

    -

    Some

    (4)

    -

    55, 57, 238,

    -

    Something

    (3)

    -

    14, 236,

    -

    Sorting

    (1)

    -

    216,

    -

    Soul

    (1)

    -

    14,

    -

    Source

    (6)

    -

    29, 37, 49, 50, 125, 182,

    -

    Sparse

    (5)

    -

    17, 29, 60, 85, 139,

    -

    Special

    (1)

    -

    110,

    -

    Specific

    (5)

    -

    93, 116, 117, 120, 121,

    -

    Specified

    (2)

    -

    16, 59,

    -

    Sql

    (5)

    -

    14, 94, 95, 113, 195,

    -

    SQL

    (16)

    -

    7, 8, 16, 18, 22, 23, 31, 33, 35, 59, 140, 141, 200, 201,

    -

    Sqlite

    (2)

    -

    8, 32,

    -

    SQLite

    (13)

    -

    11, 13, 22, 31, 45, 68, 81, 97, 140, 186, 187, 228,

    -

    Sst

    (1)

    -

    37,

    -

    Stallman

    (2)

    -

    204, 210,

    -

    Stamped

    (1)

    -

    37,

    -

    Stamps

    (1)

    -

    231,

    -

    Standard

    (5)

    -

    6, 31, 42, 51, 154,

    -

    Started

    (2)

    -

    5, 15,

    -

    Static

    (1)

    -

    41,

    -

    Statistics

    (1)

    -

    53,

    -

    Steps

    (1)

    -

    220,

    -

    Still

    (4)

    -

    14, 16, 59, 157,

    -

    Stored

    (4)

    -

    97, 103, 194, 196,

    -

    Streamed

    (1)

    -

    188,

    -

    Strength

    (1)

    -

    14,

    -

    Strengths

    (3)

    -

    31, 57, 61,

    -

    Strictly

    (1)

    -

    96,

    -

    Strike

    (1)

    -

    30,

    -

    Strikethrough

    (1)

    -

    83,

    -

    Stripped

    (1)

    -

    194,

    -

    Strong

    (1)

    -

    53,

    -

    Structural

    (10)

    -

    16, 40, 59, 141, 188, 190, 194, 196,

    -

    Structurally

    (1)

    -

    194,

    -

    Structure

    (36)

    -

    8, 14, 16, 30, 35, 40, 57, 59, 98, 100, 101, 113, 141, 143, 178, 186, 187, 195, 223, 224,

    -

    Structured

    (9)

    -

    6, 8, 11, 14, 22, 31, 140, 194,

    -

    Structures

    (2)

    -

    23, 193,

    -

    Structuring

    (8)

    -

    6, 8, 14,

    -

    Stub

    (1)

    -

    121,

    -

    Style

    (1)

    -

    153,

    -

    Styles

    (1)

    -

    137,

    -

    Sub

    (5)

    -

    35, 111, 122, 224,

    -

    Subject

    (2)

    -

    111, 189,

    -

    Subscript

    (2)

    -

    83, 85,

    -

    Subsequent

    (1)

    -

    125,

    -

    Substantive

    (5)

    -

    37, 38, 99, 190,

    -

    Subversion

    (1)

    -

    230,

    -

    Such

    (10)

    -

    15, 57, 68, 100, 125, 148, 151, 152, 231,

    -

    Sudden

    (1)

    -

    14,

    -

    Suggest

    (1)

    -

    222,

    -

    Suitable

    (6)

    -

    7, 8, 23, 53, 220, 224,

    -

    Sum

    (1)

    -

    57,

    -

    Summary

    (3)

    -

    21, 24, 83,

    -

    Sums

    (1)

    -

    57,

    -

    Superior

    (2)

    -

    227, 231,

    -

    Superscript

    (2)

    -

    83, 85,

    -

    Support

    (1)

    -

    233,

    -

    Supported

    (1)

    -

    159,

    -

    Synonyms

    (1)

    -

    14,

    -

    Syntax

    (15)

    -

    22, 25, 28, 44, 48, 64, 71, 82, 83, 235, 238,

    -

    System

    (31)

    -

    5, 6, 7, 9, 13, 14, 20, 32, 35, 37, 45, 91, 92, 96, 97, 98, 110, 125, 142, 195, 196, 199, 231,

    -

    Systems

    (2)

    -

    213, 231,

    -

    Sytax

    (1)

    -

    235,

    -

    Tabbing

    (1)

    -

    159,

    -

    Table

    (10)

    -

    68, 100, 104, 127, 128, 144, 153, 157, 192,

    -

    Tables

    (16)

    -

    7, 8, 15, 23, 33, 57, 85, 92, 96, 99, 104, 143, 176, 188,

    -

    Tag

    (1)

    -

    37,

    -

    Tagging

    (9)

    -

    8, 14, 16, 22, 59, 61, 98, 194, 238,

    -

    Tags

    (6)

    -

    14, 34, 105, 106, 107,

    -

    Tailored

    (2)

    -

    103, 196,

    -

    Take

    (3)

    -

    57, 98, 231,

    -

    Taken

    (4)

    -

    5, 14, 16, 59,

    -

    Takes

    (2)

    -

    31, 120,

    -

    Taking

    (1)

    -

    15,

    -

    Technical

    (1)

    -

    53,

    -

    Technology

    (1)

    -

    41,

    -

    Tell

    (2)

    -

    213,

    -

    Templates

    (1)

    -

    42,

    -

    Tend

    (2)

    -

    231, 232,

    -

    Term

    (2)

    -

    6, 42,

    -

    Tested

    (2)

    -

    158, 160,

    -

    Testing

    (1)

    -

    8,

    -

    Tetex

    (1)

    -

    51,

    -

    Tex

    (1)

    -

    77,

    -

    Texinfo

    (2)

    -

    68, 81,

    -

    Texlive

    (1)

    -

    51,

    -

    Text

    (78)

    -

    7, 8, 9, 23, 30, 31, 35, 36, 42, 48, 57, 63, 70, 90, 91, 92, 93, 95, 96, 97, 98, 99, 102, 103, 108, 126, 146, 147, 149, 150, 151, 152, 172, 190, 191, 192, 194, 195, 196, 200, 208, 214, 215, 216, 218, 222, 224, 234,

    -

    Texts

    (7)

    -

    7, 53, 148, 188, 190, 224,

    -

    Than

    (14)

    -

    11, 17, 23, 25, 29, 55, 66, 117, 139, 176, 182, 231,

    -

    Thanks

    (1)

    -

    44,

    -

    That

    (59)

    -

    7, 8, 9, 11, 14, 15, 16, 17, 23, 32, 33, 37, 49, 51, 59, 61, 65, 90, 91, 92, 97, 98, 99, 100, 101, 104, 107, 111, 117, 122, 125, 139, 159, 182, 195, 196, 199, 207, 213, 231, 237,

    -

    The

    (367)

    -

    5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 19, 20, 21, 22, 23, 28, 29, 30, 31, 33, 34, 35, 36, 37, 38, 40, 42, 45, 46, 49, 50, 51, 55, 57, 59, 60, 61, 63, 65, 66, 68, 69, 72, 81, 85, 86, 90, 92, 93, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 110, 111, 113, 117, 121, 122, 124, 125, 126, 132, 133, 134, 136, 137, 141, 146, 147, 149, 150, 151, 152, 158, 160, 176, 181, 182, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 199, 200, 201, 207, 208, 213, 214, 215, 216, 217, 220, 222, 224, 228, 230, 231, 232, 234, 238,

    -

    Their

    (17)

    -

    16, 36, 57, 59, 65, 96, 98, 113, 134, 141, 182, 194,

    -

    Them

    (4)

    -

    50, 90, 190, 235,

    -

    Then

    (6)

    -

    8, 16, 59, 136, 181, 200,

    -

    There

    (21)

    -

    11, 14, 15, 22, 35, 43, 44, 65, 68, 99, 104, 114, 117, 121, 136, 159, 178, 193, 213, 220,

    -

    These

    (16)

    -

    7, 8, 31, 50, 68, 81, 98, 114, 141, 200, 222, 231, 235,

    -

    They

    (13)

    -

    7, 57, 96, 98, 113, 147, 151, 182, 188, 191, 230, 231,

    -

    Things

    (3)

    -

    14, 104, 136,

    -

    Third

    (1)

    -

    191,

    -

    This

    (59)

    -

    5, 7, 9, 14, 15, 16, 22, 29, 30, 32, 41, 42, 49, 51, 53, 57, 59, 62, 65, 66, 68, 90, 96, 97, 98, 99, 104, 107, 108, 111, 125, 126, 132, 133, 141, 148, 178, 188, 194, 197, 207, 213, 222,

    -

    Those

    (2)

    -

    208, 228,

    -

    Though

    (9)

    -

    99, 113, 148, 188, 197, 214, 222, 231, 238,

    -

    Thought

    (2)

    -

    14, 23,

    -

    Thousands

    (2)

    -

    90, 213,

    -

    Through

    (4)

    -

    16, 30, 59, 181,

    -

    Tied

    (2)

    -

    7, 57,

    -

    Time

    (6)

    -

    8, 12, 14, 37, 53, 81,

    -

    Times

    (1)

    -

    8,

    -

    Title

    (2)

    -

    14, 189,

    -

    Titled

    (2)

    -

    108, 126,

    -

    Toc

    (2)

    -

    151,

    -

    Today

    (2)

    -

    9, 23,

    -

    Together

    (7)

    -

    15, 50, 55, 103, 187, 196,

    -

    Told

    (1)

    -

    14,

    -

    Tongue

    (1)

    -

    14,

    -

    Too

    (2)

    -

    50, 97,

    -

    Tool

    (3)

    -

    8, 52, 226,

    -

    Tools

    (5)

    -

    33, 51, 227, 230, 232,

    -

    Towards

    (1)

    -

    14,

    -

    Tracked

    (1)

    -

    231,

    -

    Trade

    (1)

    -

    15,

    -

    Transformations

    (2)

    -

    16, 59,

    -

    Transformed

    (2)

    -

    181, 182,

    -

    Transforming

    (2)

    -

    8, 226,

    -

    Treated

    (1)

    -

    96,

    -

    Tree

    (1)

    -

    157,

    -

    Trivial

    (1)

    -

    90,

    -

    Trusted

    (1)

    -

    125,

    -

    Try

    (2)

    -

    60, 65,

    -

    Two

    (2)

    -

    8, 154,

    -

    Type

    (7)

    -

    16, 31, 35, 49, 59, 100, 113,

    -

    Typeface

    (1)

    -

    57,

    -

    Types

    (12)

    -

    9, 15, 16, 31, 59, 68, 84, 137, 140, 143,

    -

    Typesetting

    (4)

    -

    16, 31, 59, 141,

    -

    Typical

    (1)

    -

    28,

    -

    Unchanged

    (1)

    -

    37,

    -

    Under

    (2)

    -

    108, 126,

    -

    Underscore

    (2)

    -

    30, 83,

    -

    Unidroit

    (2)

    -

    147, 151,

    -

    Unified

    (1)

    -

    15,

    -

    Uniform

    (1)

    -

    88,

    -

    Uniformity

    (1)

    -

    238,

    -

    Unique

    (2)

    -

    55, 99,

    -

    Unit

    (3)

    -

    14,

    -

    Units

    (3)

    -

    6, 14,

    -

    Universe

    (4)

    -

    6, 14,

    -

    Unix

    (5)

    -

    6, 51, 227, 228, 230,

    -

    Unless

    (1)

    -

    117,

    -

    Update

    (1)

    -

    44,

    -

    Updated

    (4)

    -

    7, 41, 44,

    -

    Upgradeable

    (2)

    -

    43, 44,

    -

    Upon

    (1)

    -

    66,

    -

    Url

    (3)

    -

    49, 50, 216,

    -

    Usability

    (1)

    -

    87,

    -

    Usable

    (1)

    -

    91,

    -

    Use

    (24)

    -

    14, 16, 34, 37, 51, 54, 59, 92, 93, 96, 99, 105, 106, 141, 196, 199, 222, 228, 231, 232, 234,

    -

    Useable

    (2)

    -

    55, 92,

    -

    Used

    (21)

    -

    16, 22, 23, 32, 35, 40, 45, 59, 92, 99, 100, 111, 113, 117, 122, 214, 231,

    -

    Useful

    (4)

    -

    14, 97, 222, 230,

    -

    User

    (3)

    -

    16, 59, 188,

    -

    Uses

    (1)

    -

    23,

    -

    Using

    (9)

    -

    7, 14, 28, 31, 49, 50, 97, 200, 222,

    -

    UTF

    (2)

    -

    28, 31,

    -

    Utf

    (2)

    -

    49, 234,

    -

    Utilising

    (1)

    -

    198,

    -

    Value

    (1)

    -

    107,

    -

    Values

    (1)

    -

    57,

    -

    Variants

    (1)

    -

    68,

    -

    Varies

    (1)

    -

    97,

    -

    Varieties

    (1)

    -

    14,

    -

    Variety

    (4)

    -

    15, 23, 53, 176,

    -

    Various

    (7)

    -

    8, 14, 22, 32, 41, 148, 224,

    -

    Vary

    (1)

    -

    97,

    -

    Versa

    (1)

    -

    134,

    -

    Verse

    (5)

    -

    7, 33, 57, 92, 99,

    -

    Version

    (29)

    -

    5, 14, 37, 73, 75, 76, 77, 78, 79, 98, 108, 123, 124, 125, 126, 178, 194, 217, 229, 230, 231, 235,

    -

    Versions

    (17)

    -

    14, 50, 81, 97, 102, 103, 125, 136, 145, 176, 178, 190, 191, 192, 196,

    -

    Very

    (8)

    -

    11, 16, 19, 31, 59, 61, 99, 141,

    -

    Via

    (2)

    -

    8, 31,

    -

    Vice

    (1)

    -

    134,

    -

    View

    (3)

    -

    64, 71, 216,

    -

    Viewing

    (1)

    -

    182,

    -

    Vim

    (5)

    -

    64, 71, 235,

    -

    Vis

    (4)

    -

    23, 97, 102,

    -

    Visual

    (1)

    -

    83,

    -

    Vocabularies

    (1)

    -

    36,

    -

    W3m

    (1)

    -

    175,

    -

    War

    (1)

    -

    28,

    -

    Was

    (13)

    -

    5, 9, 15, 22, 53, 57, 104, 110, 215, 216,

    -

    Way

    (8)

    -

    15, 22, 98, 110, 111, 213,

    -

    Ways

    (5)

    -

    14, 57, 187, 226,

    -

    Web

    (11)

    -

    5, 8, 9, 14, 15, 23, 224,

    -

    Weight

    (1)

    -

    168,

    -

    Well

    (6)

    -

    33, 55, 178, 196, 220,

    -

    Went

    (1)

    -

    23,

    -

    Were

    (1)

    -

    110,

    -

    What

    (11)

    -

    14, 16, 57, 59, 98, 149, 152, 200, 216,

    -

    Whatever

    (3)

    -

    45, 49, 65,

    -

    When

    (4)

    -

    57, 90, 104, 213,

    -

    Where

    (16)

    -

    16, 35, 49, 59, 97, 99, 108, 111, 126, 132, 133, 213, 216, 217, 218,

    -

    Wherever

    (1)

    -

    120,

    -

    Whether

    (1)

    -

    32,

    -

    Which

    (49)

    -

    7, 9, 14, 22, 33, 37, 42, 57, 68, 85, 90, 92, 93, 95, 96, 97, 98, 111, 136, 148, 159, 181, 182, 187, 190, 191, 195, 196, 199, 200, 201, 213, 215, 216, 217, 218, 222, 224, 231,

    -

    While

    (3)

    -

    16, 59, 186,

    -

    Whilst

    (2)

    -

    22, 137,

    -

    Who

    (1)

    -

    57,

    -

    Whole

    (2)

    -

    44, 231,

    -

    Wide

    (10)

    -

    14, 15, 23, 39, 53, 54, 115, 136,

    -

    Wider

    (1)

    -

    176,

    -

    Widest

    (1)

    -

    137,

    -

    Wiki

    (1)

    -

    25,

    -

    Will

    (12)

    -

    8, 14, 35, 97, 98, 104, 125, 149, 187,

    -

    Win

    (1)

    -

    14,

    -

    With

    (61)

    -

    5, 7, 9, 11, 12, 14, 15, 22, 23, 31, 33, 35, 37, 50, 55, 64, 87, 96, 97, 98, 100, 103, 114, 117, 136, 178, 187, 190, 192, 194, 196, 197, 199, 213, 216, 217, 218, 224, 226, 235,

    -

    Within

    (26)

    -

    5, 7, 8, 14, 36, 37, 40, 54, 57, 92, 113, 117, 122, 178, 200, 213, 214, 216, 217, 218, 222,

    -

    Without

    (3)

    -

    14, 194, 213,

    -

    Wonderful

    (1)

    -

    15,

    -

    Word

    (7)

    -

    14, 16, 59, 221,

    -

    Wordmap

    (1)

    -

    36,

    -

    Wordmaps

    (2)

    -

    222,

    -

    Words

    (4)

    -

    36, 222,

    -

    Work

    (8)

    -

    8, 15, 33, 49, 110, 111, 137, 201,

    -

    Working

    (3)

    -

    122, 136, 178,

    -

    Works

    (3)

    -

    7, 56,

    -

    Worth

    (1)

    -

    8,

    -

    Would

    (7)

    -

    15, 23, 49, 64, 71, 96,

    -

    Writable

    (1)

    -

    25,

    -

    Write

    (4)

    -

    44, 143, 237, 238,

    -

    Writing

    (2)

    -

    53, 231,

    -

    Writings

    (1)

    -

    23,

    -

    Written

    (2)

    -

    15,

    -

    WTA

    (2)

    -

    151, 152,

    -

    Www

    (2)

    -

    112,

    -

    XFS

    (1)

    -

    45,

    -

    Xhtml

    (1)

    -

    178,

    -

    XHTML

    (2)

    -

    7, 44,

    -

    XML

    (22)

    -

    7, 8, 11, 14, 16, 18, 29, 31, 32, 51, 55, 59, 68, 140, 141, 177, 178, 200, 214, 220,

    -

    Xml

    (5)

    -

    78, 79, 94, 192, 196,

    -

    Xpdf

    (1)

    -

    185,

    -

    Xxi

    (1)

    -

    50,

    -

    Xxii

    (1)

    -

    51,

    -

    YAML

    (3)

    -

    22, 81, 220,

    -

    Yaml

    (2)

    -

    102, 143,

    -

    Years

    (2)

    -

    14, 54,

    -

    You

    (31)

    -

    9, 11, 12, 14, 17, 35, 44, 49, 51, 66, 98, 139, 149, 152, 159, 200, 213, 215, 234, 238,

    -

    Your

    (9)

    -

    7, 13, 45, 49, 213, 234,

    -

    Zip

    (1)

    -

    50,

    - - - - - - -
    - -
    -

    - - SiSU - -

    -
    - -

    - Output generated by - - SiSU - - 0.59.1 2007-09-25 (2007w39/2) -
    - - SiSU - - Copyright © Ralph Amissah - 1997, current 2007. - All Rights Reserved. -
    - - SiSU - - is software for document structuring, publishing and search, -
    - - www.jus.uio.no/sisu - - and - - www.sisudoc.org - -
    - w3 since October 3 1993 - - ralph@amissah.com - -

    -
    -

    - SiSU using: -
    Standard SiSU markup syntax, -
    Standard SiSU meta-markup syntax, and the -
    Standard SiSU object citation numbering and system, (object/text positioning system) -
    - Copyright © Ralph Amissah 1997, current 2007. - All Rights Reserved. -

    -
    -

    - - GPLv3 - -

    -
    -

    - - SiSU - - is released under - GPLv3 - or later, - <http://www.gnu.org/licenses/gpl.html> -

    -
    -

    - SiSU, developed using - - Ruby - - on - - Debian/Gnu/Linux - - software infrastructure, - with the usual GPL (or OSS) suspects. -
    - Better - "performance, reliability, scalability, security & total cost of ownership" - [not to mention flexibility & choice] use of and adherence to open standards (where practical and fair) and it is software libre. -
    - Get With the Future - - Way Better! - -

    -
    - \ No newline at end of file diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/digest.txt b/data/doc/manuals_generated/sisu_manual/sisu_description/digest.txt deleted file mode 100644 index 484b0826..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/digest.txt +++ /dev/null @@ -1,429 +0,0 @@ -SiSU - Description -Ralph Amissah -sisu_description.sst ----------------------------------------------- -SiSU Document Content Certificate (Digest/DCC) ----------------------------------------------- - md5 digests ------------- -Sourcefile digest: b89ccdad9f6d9c2260d8d383d6b35ccc - source filename: sisu_description.sst -available outputs: http://www.jus.uio.no/sisu/sisu_manual/sisu_description/sisu_manifest.html - time generated: Tue Sep 25 02:54:06 +0100 2007 - SiSU version used: SiSU 0.59.1 of 2007w39/2 (2007-09-25) - Ruby version used: ruby 1.8.6 (2007-06-07 patchlevel 36) [i486-linux] ------------- -Document Digest Tree (from dal): -level (if any), ocn: digest clean (no markup/notes), digest all (includes markup & endnotes) - [endnote number] endnote digest clean - - :A - 1 1a817535f7ea385a7a6934aa8b17cb6a 23b285e3bbc81587637f0f54034878b0 - :B - 2 994e36b0d838e2f3d2f466601b95972e 6d95cf1a669f9357c6dd47accb4b03ef - 1 - 3 08667d1a83dda20d779c7fd297609fe9 fca62c30c209e6a1ccec6d694aa43357 - 2 - 4 d281df9fc824bf30c64480cfae9cf789 0d684d8fb19cc547a8378490a9866682 - - 5 e4666ab26543650973a1533d1f87d61e 1291d44f154fae1c229fac785cf33cb3 - [1] 58fd56bb4cc814baf81dc45476499639 - - 6 e8c4abda4459824d015b74a7dd5a3a1c de0fcd1ef947b9c2167a576d95c40c33 - [2] 50b298592f4a216bc18349dca39cb1ca - - 7 d0c222abb52fd47615cbbca8ac4e0870 9c8cd70adc044746892cd7162cbef997 - [3] c693e995e4b337e1a0be8219aaf16767 - - 8 88972f56a2a13d8875d8ea10029020f7 e2327c3dc0f44896a9b2d22dd5ee2cfd - [4] 3c19bf673a6449f2b4c9729b3c3bc216 - [5] a80c86f1e05a5513298e167384635288 - [6] d9e8c362b6baa99d9a2f1db55e20544d - - 9 ed7d166bb7e6440d7441f8ef629a12e1 81d022fdfa0f089f6eb0357a6daa4f51 - [7] 38a2977f0880f6ac0c2eec89b13407d6 - - 10 6d2cd43f7b67d65278d33218fb3c0e16 0c2e562e7d7c4fc9f5f42bdada81b289 - - 11 1fb44441172889de42eb85a5954f8ea5 978ee4863654a57c43db93d5c896d015 - - 12 195320d691bf9ffbe6a9aa29ea2f5ef2 216c62fcfabf379da22e04197c3b5024 - - 13 cc7e973a437795e9648803f6874817f1 4d2663c38b8755e67a063ba49c4f2cb7 - - 14 b743e6466f9c6a0ee754c962bcecd1e5 0a598f7e350d0a29074543f6eaa6b86e - [8] d3a98b6fc44ba27d424e5f6b6c771bb3 - [9] 8aa18725e5bf88dd2ca71cefaafa8f80 - [10] c206f6a5a1466311e20a63fee5058c33 - - 15 43c4fda771f06b28496de398504ec8f5 a23627fb05f326e16c74d8974d096cde - [11] a212e6fd906aed6b62df1263bd750efa - [12] ceb3a0857807cec43df64be233eaa915 - [13] ca363c528efbd7c2db13f5e0da4a2b16 - - 16 adf75dcbe33257b55079ce90ffa630a2 f020cae58ee564a0310da836e7659f1b - [14] c00b9043f6a254c90796d0f883429c5a - - 17 26f9e779b126ad480a19fec1e6cff99c 5b54d644227a4020adadce4d640cf53a - - 18 848113e2286ab6407be7966029d34978 55bd0eeb6bc3b444643e3e27ccd467a3 - - 19 ed6bea9f2af822d92f981cd233d5e56f 9a98b3239f5a78617b5ddbd0b7297b57 - - 20 39d053e9ab688552b8bb98efa7e76ce1 64d651ad30709e5fd106da071381be0f - - 21 4255c3554b1571907acd6ad0126db5f0 e70d18f35d549508742d78a4a977086f - - 22 0f20125f3494ada62bd65c85b4c3f182 b314417b7dc29624d2eac77538123794 - [15] 8a6732de4e335eacc9ac412f53f8ad5a - - 23 e2267c68bb5656c404f7e431dc69bafc e6d5868fbb57b013976344d30afa0cbe - 2 - 24 5ff7130b7d6b0ef913cc8264cd22ed4a 0b8976ccda82a57f0735979d0e1cfb51 - - 25 5939ebdee50d234a06ef61c07ca261c8 7fd2f71f6dccc0fd5cb57f17157d7ea5 - - 26 81fccf43edfb45a5459bfd1399913dd0 c63145afeb97c36e42142be838a170a6 - - 27 0344c5998dc084e805b2ae6cf736e984 22d246668900d1cd134a940ba893ab21 - - 28 876ed1f1adec47cb9bfe66e21ce31a99 76698c051c71e28451d5101f13bef2fc - - 29 6147d87778ae6d588a5ccbb40e865a1a 756f49a8b71ed31f50036454f57b28dd - - 30 a1057af1444f047261bd9d55e22c6bf8 f5d7337d68d0a7ab43c4fea22e96d996 - - 31 9ed19e2ae5c819d7fd90881b3551a66f 9cd96a0fdcf0ef53f8fdcc70a5f8d33e - - 32 325ab45988a8e2b487937e9fef89a03b a031ac953a4a2244cc25d888fa11557d - - 33 4d583aa4cd76f8d312314a7451cde11c 70b35ed716d1af12f98e97fd2df51d42 - - 34 5330201856111ce30c14e263838210fe f0af35915bcff3c61935be2b3b05306c - - 35 66dacc00e5f6a9100a8e2e5125f844ba a3d16c75e46a518ae67628c541d1adcf - - 36 bb48aeccfabf98525c2156f1d35b80e5 a0cb35722c0ed919d35daebf904c82d8 - - 37 0c4b6f8b66a1398930b40373641f894f d1723efe147f5911bb0fddad12d49ff4 - - 38 ae966445de676cb433194823580ce44b 4604f803e63796cb802135c04a2cfc5d - - 39 35c90018a0e41cfd7f0868d7cd6a3cbe 4c8f136be88f2c76ebec2a59641c7f0d - - 40 e4e3970b8314b04846833dc32985cefa 51921e2dc7327e67c7d119af2844931c - - 41 a410303c8f69f01507e80a10fc64f20d 32875c277565db01a5ad63bff4dffacb - - 42 3e7497994de65305df5084adf82c7c1f f2ed0b5c88ac9217b8c56a737123d904 - - 43 5f4713d8954f69dc98bed3f5ccd81a5f e86e56edee32034b5981ecc4e8e0f001 - - 44 90a84670a604a2967577dcf3fb12bb9d f64bf31675166d8d43dcc8a84a29c8f9 - - 45 ae1202842ff50f46eca62d4df19a81df 9120ee4ae2e1af02e226fdfb7371d8ae - - 46 b4270a16bdcca29de577027015742fce 343ce6d2ce8b38086201587271a73def - - 47 c3ab7194f25762f8d571b9665adfe48c d2219c0b3a08b570f9b32d67b1526dc2 - - 48 67a5415df9306b2331276d355441fb9c 14a671b9084f3010674c6913f86c6e4d - - 49 585226d407b9a3f5c25f4f0ef31768b0 7b50d6d084a6382c945c6f86ddf76080 - - 50 f0c7b3c43b79bb898f4de78a18fdd2a7 77adc9bee70ad1a6bed7a65d5d1058d9 - - 51 d5153640ad696495853f994a1d4c4d59 455e8d746068b1b66b16275bcc883696 - - 52 db4f3c8db78b6bfcffc77e5e1b47aeb8 518e25221e34216c5f8159de2c3ee248 - - 53 d226b785f7c632e92af9dcbc9689e06e 4779020c0b92ee82708eaf9a4008e2fd - - 54 3e90f277af4aaddf635f98f74fdb50c8 6294811f2179138f91d157705b6e85e5 - - 55 4b6fe25ada17fde4eade27ebf26eb6a5 24ffc38c7c7672347a2ab83dda9516b5 - 2 - 56 e09158eb69b36a8e238470e801854419 4eba3463abcea945532cebcb016e4a3e - - 57 ed535ca91879b1cc955f9f01ea2b3da0 da0c22d02e604245fd6d06670c8eb175 - [16] f85efccd9f11a44e75a943131bad0bfa - 2 - 58 aca2243ec8a41c445eb0b29522c85aba 681a849b30f1b860bfc6ae2856736f6a - - 59 ae1f58fdb77c5e54c2090295639422d4 6dc227e07964668e6c3985d3ba32226d - [17] 4c7ffbe534780600dc869dee75d81baf - 3 - 60 47a80cb5032e9d6d174b48f994476935 3bcaee0468539d307b0239341a0fce89 - - 61 41ef19ed93fef468e941bdc10ca90ad1 e10297c833c5218b4c77803fa3855776 - - 62 c1cc129ce168c17542ce6c4952298f53 73c4a670ca7386e550c60ed1534bfcfc - - 63 460b0d38c942d4124718bd39efc13fd9 ad5a4d075c3a39453d951efac10b865e - [18] 4b5b4a38b2edfaad2afa05150b8f563a - - 64 5bfe57fbe4ad22604fe179e840d89f08 d4dd26a39e64cff0984ab1cbd9ab2732 - [19] ba3a3e09fa632108cae546c728ad4aea - - 65 71b54a2d3517a734d929ceae7d0cba36 b6d01d840e85c25118b0da3c10d43532 - [20] 4bc0d3b635d3544eea637415d30dc4b5 - - 66 d9e3df8a5de229753069de57eea2f0de c3bb109926f00a8c4a07dc465e634ec1 - 3 - 67 fb78be5b63d5b50ea27bce4b5bb8fd33 b93ea98bbc4ec0973d2a6d6460eec8ad - - 68 68697acff91e22f9ff4065da65de8aa8 9df6327cd6777516caf3e03681630835 - [21] b2f26f49ea19eb595e25acfd8fe2516b - - 69 377eba3b00ca8874ed6a53d278bf418b 2c8ac7dbb2327d2eb19bcef72ec9a3f8 - - 70 8a376aa3b3a2729aa1cf1e653b51e697 e0b4cdf18d476bd3e308334713dc6ec1 - [22] 56889585f8696c368e832027659f0e6f - - 71 e72d6984666b9e4bc17bf98c28404aec 0a8dca9b4c0044a838dbf71507c62392 - [23] 04c1a9ef5f260c099dee47c226075249 - - 72 9f92de880ddb1c321abb95a71301b6e1 1612a5b3131126162a63aeea01210465 - - 73 3de19221f126b1247aab473cf0615426 8ee144a3caa4cd8311e248654efeb9b3 - [24] cbfc890985750cebd192e180998ac7d3 - - 74 b68ff0cde136c9a6339d42b712bd8b6e 574202d0be9a4604d2ac8227ee71739b - [25] 7b0b31bfef4686c49cac42c23f948c5e - - 75 b49454f95fa91ee9b80c73c546632558 6eb4a96e2d63a2f453b59cda7e2cb321 - [26] 384bcb96fb09a3aa1693ba30aa078d12 - - 76 412c57cb5ae043701ac087f55b11704a bb46f2a8980af5d46acef3565fc0a5c5 - [27] 4edcb38257435ca433f92bc58f02a52d - - 77 30ffb6983a71fd8b5f946b193e05fd3a 90b4d75a3ccb3f03eb42e0da218ace78 - [28] 8670dd4936e698e1b48186cd84c798dc - - 78 b37c492f5d751fa0432dcc75cb1e3a4d 829cd7902770d276e2cb602c1bc834e0 - [29] aca4d751f1e6e0507fad9d008bd7fd4d - - 79 837c94b590e398014d0304bbeeef3497 1333f3bbd2565d896d4dc1f35e2c8e4d - [30] 7a040fbc1a8a0273c56774252266ae34 - - 80 03408b35c18744d32d4af15f76f68f47 7ae6e3c815f67b9a73946627f4a0aa32 - [31] fad12377c92253a87f5af57e0fa0f29c - - 81 7143125402e8ae11c41d4490fd21ca2d d9777f2fc911253299fa41303f3171f6 - [32] 2b796a72d7ab5143491e29403d685d55 - 3 - 82 fd9578e47b466f64aeb0205a98ab1f86 46fced2cb29e0ebfce4e5c2824cfe617 - - 83 983b0d5f5987875fcfead48e1f9bc8fc c7aa1d0d48cb1ed2160a71c356e09adf - [33] 54b176c24fc811ce9754f893b1f7c965 - 3 - 84 2f091cf4f13424e9b81483dfc584e1c1 c8635db678614823de2ef33d673be166 - - 85 51c7582fa9dbfb8aa168410b67a0fb37 f01866d559e1ecd69af5cadbe64df577 - - 86 d55e9a8364998d7c5a85e33c07a20a54 17545b3fbb75548397d305fb26d835ef - 2 - 87 05bd236649840ed531cee0db2d5e20ad 2c76401c61fc08d8a8f430b32a121f72 - - 88 ec2cef505ba684bbe9b8421a28a171aa add496a70d1edb8a980c07c1555722bf - 2 - 89 2b1576fde2823489eb293d85a22c6001 270f47bc58a41a123699c079a05f974e - - 90 6587264c121d43c02092d4a8fa3fe5de e3bdb412952e3779df075875256ea3a8 - [34] 02ed5e6c8f2d4e62a283b047237f18e8 - 2 - 91 9e2e300ae9124cf9f2c2780a847eee6f 5fe0525b1cb1e00d801e11cc0276b65e - - 92 b2e365ede3f802881ffba37eded80d9c 2ded1fd8c2f4cd9dcb95b32a7315745c - - 93 61bf53edbdb67d8b0d97170b30050255 e3916766537ec07b5cf0a062650f1e4b - - 94 b58bf73bd49002e654d42e0b4a9a6ae2 bf22d584b5d072bf7c5bd21a1e22493f - - 95 de4f9e7c26d6c4a5e7bf92e6720765de 641fea6f1687a85364c2562c35545c2f - [35] d475ec229ecca11a49a491b0c94f636f - - 96 01db1658c132d0795229fe4ba87b5569 f7b34d22c08a21d203f724919344b7b4 - - 97 bfa442e78fccdbc63466231018209f5b 8254f7836cd7b577e53323145113b12a - - 98 bd383b9439cd51f40643fc8a2ba22e7c 47295943a1ac15951089c2f3911a947a - - 99 8c5bb1998c01db5872b84207a228ce08 fa357eb854ccdbe59595a2b931fc651e - - 100 4057ff04b40b87db82f9681749c103dd 13c824941b1e40134ad593488a851d32 - - 101 f1c77b513117420d08df8f71718eefce 8ee0d74913d66f486ed24208bab434ea - - 102 466bd8dcf523b056c9964772da1e4f94 c2bbcb0a27f11542a3d28b173d8a0966 - - 103 46c4c83c2b74f08eaf342c86617b2093 39f011d7fceed3237ef303ac51263864 - - 104 62ecf16a470c30f40bcfad679c376311 748550f4cbff46f51d585cbfa213ccf3 - 2 - 105 b0e5f33f66cf17789d321ff52baac118 934d6367eeeea23845fbbd88217c46f2 - - 106 d7d04a399729f0a1d5c4cb6ecd97d169 a99ec3a14e2ecffbddd4c7df1cf91f00 - [36] bd3b47e804e6c962b2a629e64577b7bf - [37] 343958722cb0d4c994ab17a684465e58 - - 107 138d12a9f12e72c179c6f989bc172a2b 16dc955e4e3220c8b7103ed5c6096260 - - 108 15124affce1794339ff9dc1d74d80f10 a8f107014975fb72345b1d0451755907 - 2 - 109 a13ed5f3f5318af1469ec9ee17d63b06 8c6afdbb985c2997fcfc874c4ba8765c - - 110 17eedf182056fb9fc9dabce0d934d2bb ac1663e8fd63c79d57053dade405188a - [38] aea6d486459c53ffec73fc310cca33d5 - - 111 4c8ab9166f38767af26fc2c37249b26c 822b69c1aa088e3d1be399539e4d8ae9 - - 112 993295dc46872dd6c124b5d1660e653c 3647f2094954c647ffa53c82064a9a79 - - 113 6a2ba327215d85bb7a0167525e05b641 800859c3c70a9b1ea6d09bccddfeb880 - - 114 88231e6470256a925104a1af32e19746 55769370bf060e251e578b618e29dfc1 - - 115 fcf329737cdb1e8e5faf412afc9c0f28 929a603dd836b26f95acb8f138ac44d8 - - 116 ee93f80c9ebb207d470b70efc4a98a0c 65038b2b94dc2344ce0f0193cf5dd42f - - 117 765f4757898e670cec311ca316d4f0ab c412c3be2c5c18065a11eb41ff8c5d4f - - 118 eba0aaa27c88881b013b3bdfd6ef6419 17cddc21838030c94358cccff7c483af - - 119 ef72ac6a2d16313d18b5a4928451c1ef 83bd7f12ce7f19d247cf53d8142820a2 - - 120 50bce4c80795214fd6188a25b35e854b 54a4c74caf816131f4aa6d94b2110b6b - - 121 bb4bd00b648615c4f8fe3a62d1f7c8ee 664cfad150b5b613b58122c4af9efda3 - - 122 5ec02f520a98f86d6d74fc07041a411e f93b4fc8ca2731339579d1186297dbff - 2 - 123 8d04fd185f3105c6c9b91257877c292d a208c8642741c66215ee8de7c1568216 - - 124 dd17ee307bfe6522a60ab12bcef4bc20 38db40c00d42a7f27f5acbe488088c2f - - 125 6d3539e402883d1ba8b7ce9ee68225a1 03579a412c0fc67fa1409ea708c11be6 - [39] ea422ca9ec6f899b611ffbe3e09a685a - [40] ccf2092eba114a0b8822983bead96e6d - - 126 91b79319cacac240d62d9ced0d6f3f72 b92335b6e089f5ded46f08e345aebee9 - 2 - 127 4c739dc392d5f0b5bd700117656edcba 5c1ec6d9a1318f45043596aa658785fb - - 128 b404fdc11f93706ed4c4990521ef2c81 1e6d10a855caf059fb271f8e382a8841 - 2 - 129 f0c7c4e8d7dbdb7c8640a931a737a5ad 255413b4e9f19be520de9587f607401d - - 130 b2de509e8a0aa3e811ea3933e217ef97 85ec2e37b2e82412451509d514794c21 - 2 - 131 70654874cb4c3bafd1362ac37a1909eb d81ece25f12c60fede2dd8614712b5a6 - - 132 325d8b0e348a0b3eec87e24a52ecda9d ae0a63522ceb436e388457a5d294ae44 - - 133 2d20a332eb24a1cf6e413f0e0326f4ea 2dbaf7f41ea916240e428ed7f77abc35 - - 134 265d8eda282e3ca6d3436434be04e080 9523b28c7df90b2d8f881f26943d0ef5 - 2 - 135 c7ba14654c7d8a43c104972ec8ba4acf 3ba710cd243d7efb1981fad36b26885b - - 136 1fb539b02bf96c6dd5f37116e1e6b91c f2e667bad9d162734394c2cd338bef6f - - 137 d470b09e99c548752e83811e6fed2d6b 22661723e30f9ac1803085d22fcfaf4d - 2 - 138 9de1d9f35cde5281c1e20e9c6748bb92 9ee5a873acfbf716e3084886c614b45a - - 139 26f9e779b126ad480a19fec1e6cff99c 412fa19d540513a11d3aa05ce693d59f - - 140 64a3b041147d6fc022207813ce34e467 d65eb8c97beef5efa4b5a3075e251eb2 - - 141 85861eb47963a61a4a6441341c16755f a744e546abfc6771099308e40ac4908a - [41] 4be4409475751344bc10144995494e72 - - 142 507b503c210d031c2412c970d1cf9106 af8beb64ba67aac895006f295c87821f - - 143 f054e8849f72a9d3abd187f288996dd5 831fdad58ec5c3c8d3c8a6fd6cebc42a - 3 - 144 99c384b4025028923a555809e9267922 396b37d817a8be18b8ccf28d54bb5924 - - 145 93c4889155fd276ae4fbdc2206e191ce 5c5748f8ab44b4c5a2036c91dbc0712b - - 146 1efc652e0e12dc4d820447e77dd64b78 a0bf1ccf109db4cea71ec1d6630e421c - - 147 01f0bc02e9e6cc86635a3adc5ea82ec7 cb66162dbe83971f5049148a377ca92d - [42] 473283a66f9d480166da2f716225b0ac - - 148 4d447d1bd572a5a2a33ead303ca4c35f 8bec9200aaa63c2a3e70076cc6baf308 - - 149 961556de9403f19401dfeb59357b49b9 92cde58526bb9184540028b9b0ca603b - [43] 5c7f65b87e2ec834a1bdd4ed32cad784 - - 150 1599a2fb58e6fb95d4743e87f19fe45d 39513da6ca6bd74e208ccb36847411a0 - - 151 187e368dc7cb025a25fdf3ea31499bf7 2bb11e47bc77c5cd9fb8c8d4a2f5ec39 - [44] ef473262beed77ac803a18167a727ea8 - - 152 0ae89c8451931019172bf449c3c4204b 39af5d0496d7683f9bae2b406667a782 - - 153 8c368d56a702f08c0639b95e20e25aba 29053e581ff2b249efd736a3a190b572 - - 154 b6d15b393664cc4ae66338ea6628eed3 435432f847146a26ddfac98934a75c28 - - 155 04a3db5eec1713c8cfc90726af72b52d 755b2eb26cb3b50a4354d7eda15a1c54 - - 156 be5d5d37542d75f93a87094459f76678 f1b1ea3349e19d58e6708edcb01f5306 - - 157 bddef93eeb25ef95310a42ecf9e08aed dce5cda5a2a1bebba83ebdd0b6491a05 - [45] 0b5c185577a251c74bd10cb5f231dcf2 - - 158 3582ef5c58cb60b23c1fa82e8f39a264 9544a54a17ea74943fd3a5eb4178959c - - 159 beac5e2a80cf3bc41ff7eb64b6b16a0e 70ed3317b04a95357587c8d41adfcff4 - - 160 6d50ca709d5e09dd9b75d9456898cdf0 77e1a777e1e871170c014a4987ddb3cb - - 161 84c4deaf8a29c3e20cfaf49059cc5670 d84af1523ff1364a950e9f8eb73b0de1 - [46] 138a85559300283d8156a545b2046dee - - 162 5b234e4c04e4a9e661b3a1671a61f6fb 699e1dc4ff662c468555f46dcb668a34 - [47] e5b46d62aa211f9661a2feb3f02c79b7 - - 163 16d4cb6432d470baacbcc401de3b9eb3 461daa1c905f72f67ee9e51a75f48bee - [48] 2a3df67054cc563fffd25d049612664c - - 164 d81ab891e8d5e84586cdf5fc53ee687b 12d67af06667c23017f012939b5404c7 - [49] f545d63fc962b2ced9052d89ee17e7cc - - 165 f14ead6e33b9f064e6f9c8e5d0a18556 03fe0cad13b959bca04166e9136ae340 - [50] 496a607027d5257018443c6d0e9d3529 - - 166 f1ad3886a2e48a9ca72d81a93be02b2c 684e7726ef47907f893d5c2ad8d29b5b - [51] db0727c8878eff55ff3db13edcde8aca - - 167 111fa3ce10262667bcd7b36bc07eb336 b9808c481556d37c96f9a4fbd3b366c0 - [52] 7ea92ebe352c04d73397b1b8ed26d0fc - - 168 26d851fbf73067ce350965eff3d1cb4b 9b0f0d326fcdb1299f37855526e0fae1 - - 169 1a5b032379f88716b0053f89d8e434c4 2a162b6579218992f80e7cbbee50a69e - [53] a7185e59152e34073263318f3c893bd0 - - 170 683a9be6b2b8c64e21a1b02d6f3025c2 35d2c8e4214b5ee64699075fdf90830e - [54] fd3e9c6e7e252ca60c456fcb796c747c - - 171 1cc614fd69b6b805cb6d8c14cdf4bab3 53b19415cce5efc60a025f016dff1f11 - [55] a07e01d77c53f4d543aebd7c71c4ef63 - - 172 d858e3d4264a54c06a916a4afa65dd3c f8389b3ec2db88aaf39a1ed46b44c12e - - 173 90a6a134c3b3a15dcc06bd75199265c1 62132f208e29d1c4a98ef35dc9fd126e - [56] 9c4bd4870e45ca1acaf1f06b8b7a26a0 - - 174 91a97e69752003077dd4ea04512e2098 d6d3e43370e5e30917dcc66a3431cc59 - [57] ac0ab6f1b28e690ff8a46ec005c8a9d3 - - 175 991922287d2d432de9db1f96ba3525af fd28c9a3775da7fb6b8c3bff715bf3c7 - [58] 067af6bf609755005a1bfed38764c7f4 - - 176 b55fc12498add81eda5a5e44f95978bb fd980ff0e71d178c99c3349de361902d - 3 - 177 f5a6f5f7d32771bcebd8143a4aab3b63 5f9cb86e97bd99fadeaed05655b85491 - - 178 47b9ef75c31ca9f47954881cdd5f51e7 7367167ce01d7e74c9b19aa2a626ff1e - 3 - 179 6feb0ff199e7e560c213a0da859481ad 9fd5a3645425efeee3d010670968a214 - - 180 0e2500c5917ba70035b540705c3ef5b6 07dbb5f5bdcd3817878cca3555f2aaa8 - 3 - 181 093321fd426307a391ef5615ee1f9f0d 6d8dc5b941d5b6b40503915be9fc4bcf - - 182 ac931caa57567904388d73f185338ce9 38ef07ace64feec52be507f46cc47944 - [59] da8139f949ecfe35f8783701c8eb7962 - - 183 8b9ff7b9be5b18a32e17f68fa0221fd9 d57239e335a7f7cfbd879e1f0318fc8a - [60] 9f71102569088e4cd88b65479afdd3f1 - - 184 1cb8385210b7f84c28060d2bcacb6240 b78751f4bad5350aa94666d720891a00 - [61] 819c21ce731e3e7649b1d953a82020a6 - - 185 3d7a370c7e5fc6e345b5097b3ef84364 b4f86b768ee8142372ca9b836f6f15e3 - [62] 693a8858f4167151e5305d22f9efc4b1 - 3 - 186 dbf86e6a0d77da7560427294f3fd5ce6 fe8f06c0e21ae1653e664189417621ba - - 187 a8d334eeed9608cb061dd81a0fbb7d4f 4928b95bb2be395a5b1c5672bc54c1c7 - [63] 14689939c69ef685b66f0f61f8a4e734 - [64] b5999cfbed614726b6d47cb0603fedc0 - [65] b1b6d7beb8590922b04d92c27f4e115e - - 188 78fb7415ea11268fda446c54efd54a4a 740c1a5c248791fab1903f7a1318e75b - - 189 dfb31f935cb2f8c383a409bdb7858790 b14f2847c513b851b2a954f0a1ab55bb - - 190 dc775b479b9c40119ddddc322fea720d 6d780f91ab112fd0a5055df113b3e404 - - 191 e569de19867d22395eff154d152ea82e 8ba47ae86fa9d660c226b2128fe9053f - - 192 0c13bb4eb22b66d32a4516bc0fc23427 58245f2ee109516689939f105ceee60c - - 193 75151252ac158b73109361decb4ebc10 d948b48e598bd52f7e3938f632958ebe - - 194 29183ec0f3c82033b1b0f6a05cec0fda 3a15e2e58276050bf5de1977df145451 - - 195 eeb01d4f75ecd41cd341befde3fce2dd cbe6abce42128a5aa9dffebbbaaa45a5 - - 196 d41138656d29077df239d6c5fa8ab3c9 62c2e4efeb9743a79386d2e67acba76c - - 197 0b84010c4cc4ae31de4ce40541948dc9 9d2d900411957cb23cdee795315a150d - 3 - 198 38ed194bea89179f084ef83255b682ae 872a6dfd88deda0d637093f05fc65f3a - - 199 e067709cfce967d1061a880039493543 51fd487f18bc0e1a0fdd8d84699834dc - [66] f782401de2288c7f2291f02a002fec56 - [67] af4ee1e8851df6975fdf56485cedbb6e - - 200 7b386ac33fcf4a20e5d63e6cd4ec6afe 277519412ddb02a05da4abfbd45f3b5e - - 201 895d541a674ad678d19e93ebb783974d c8c9c749e89a5de23d320fdfd0353948 - - 202 7dfcb80b2288c074f6dfa12ee7fc87e9 e4ae2a86af435f1e2d719fd956c6ebf0 - - 203 0dd475d06ed7b4d352e6dc0dada8ae33 231649e4ee76101e56f72e0b8de42cf9 - - 204 200751ddbbec8f7c8793c95a7ba787fc 3ed7ee336f382034baf4da0e65d03579 - - 205 19b618dbc9200fd2464e49f4513bbac9 21d1e8ece36af3e3eac80f4035920487 - - 206 dcc3727dbdaa5806b5f0326926227682 9f2b35c2b33aef8258e96f5a9eb9f040 - - 207 8ad65191203ce820546239e2b2ddb347 e235106be2dba6f5b08ff56fd202a468 - - 208 b53b8d0e2a9872f919d866a8221012d2 49b4785c4b41769c4daa14e373ed8615 - - 209 4d92f58e3bbb879354c2f9c353d21621 95d51795f3e2b82b321c6a3ca5872cc6 - - 210 f1898ca2aaae6faad65f6babce5e2263 800335123fec0c4e7d2d94660e0e4932 - - 211 1de1a518f59ebf46441c04ca8db70c86 2948c9f943e6d763a459bf176b5d812a - - 212 e16a7881b4d3f3c29d2a4c6eefd89f69 9f51630dd952315bda7e63506536f6c8 - - 213 f17fbde688751b78b27caf13852eb5d4 6d9fd073dc54ea15908090bf96d578df - [68] 697e5b7dff101929c6fb646ce0aa9641 - - 214 c4be390c38e20ec726222069644cfe7b 8d3e05935a48d46eb4cbed3e293ec1c5 - [69] 4282e2cbeb92d07c8a0e42c329af0e83 - - 215 7beff870c55d7dedb93ece6ee4bc12d5 1d17ba6f8c87b017d5ccae42bedb606c - - 216 6eb1b7c6a0ee4df92feebf088bb979a0 1bcd2910be9ff05d2d2cb66384039892 - - 217 04b6575d0fadad390e7686501a6340aa 391277fb4ff7f0c43cb992fc5bbc4a5f - - 218 037d0ac64db7124770eec397bdbe6762 fdce049ed82cde02ebd42ed4fb40033d - 3 - 219 4e6128b56e0d769d814dcd4ef46c6f84 cefe2fbff7c9083a11728562d11b2a6b - - 220 a301a31a07271f4a927bd83b1e563347 8402ad041ff6f047268bf3c347ce4617 - 2 - 221 2b488ca1ee679a96d61942d6300fbe6e 429187456220d325f879064c2b427ccc - - 222 cdf28b09e7b6bc6d803b7e625145e602 2a49b1ae5ed3d59c6d95880b80153d3f - [70] 8aeb448f4c2a7a34c3500882192eb845 - 2 - 223 0d328ed82813c793f8ba6d631f3ef55b 340dc7d4d0dc84825a981dcf3bcd121d - - 224 7c40bb752203d516785cefa737669340 4fe791352603fb722c2467328a2cc953 - 2 - 225 0344743fe05572255bca230aa8a83f8f c695eeea803deb78dc65067fb2cdce3c - - 226 12567aa91a4e0c52be2994afa9ecc7c3 83517a23ae7f9470a0fc57f566990406 - 2 - 227 a57e6e198764199b5f75cfa1878e3aca 09b1d473b9299deea637fdab48fa1d19 - - 228 88b12228ef872cf38f0d1e57bf0c6f11 fd1f84104ed428bcf1633871d6393b9b - 3 - 229 408115dd685bf9fb9d99fdd42711db42 8b2c4fe87489ff13f667a8b755d7cecc - - 230 4470d753330cee8c9eb80177100c3b2c 2ed5e8175c0f46dde783eb742756ce3f - - 231 285cd37c9ee0bd01a62d50da9a166b12 dc448309d0afdd8d72dc5ec7642702e1 - - 232 c5d210867ab5f119c1b4e23af0cf557e 9c7f3d3721cdee60d98e18ec35d04ac8 - 3 - 233 cf9beacb509ec2de9d501f88d33ea037 1b990c69e132827e0e38ecb94fc73a12 - - 234 ab810e748f7342aee197ff15a21bde16 d7d317cf00fd78468c63685c90dbf40b - - 235 3e207d46f349566cbd12039df6087629 76922c8d6d8ae51d96c261ec742ac27e - [71] 04b6c5fd290d60a082a90bed3688de6b - 2 - 236 4ebe91808654f5994af1ac3aab4ffe51 be0dd54db31b22ecbe3e413b80c1c25a - - 237 e29f82c800c4eb1541e0d46692840a5a b7cc02f3fa15013e38db732ead75ff4b - - 238 9291eeb973b5dea1bb01d04855253b36 328e06479fe5de8ba3895dc183a4aa64 - 1 - 0 cc4947558ec26d67c755c76262f85131 c7a5392e3bd6d769d940d79b19a68e4c - :A - 0 5b23a6f796c80555992b9b68cfdbc2bb 5fbbfb7933004e25553d56929ed7e545 - 1 - 0 ce21470ab49d1d1976bc3dc72438c183 f2709633c4c07e46bf6aeef624e8a736 - - 0 cf49b2f6e553d2ba527eb834c3517ace 6403b404c90eb9bb9da94a5c478f9bb3 - - - 0 3bca2a5af45d3d764dd73c03690cd66e ac56056f12b17cdaf50a68e61c70c01e - - - 0 ebd0aa03a5f7fa6f0e2d38b9676fb155 6f43d92a036c9944794c8582322b5875 - -dc title - 0 a49457f89c66f174b128dbca3fd0425e 812080ca018db38eb8238a3a2d788099 - -dc creator - 0 0679b0c2bbba4dc8a8eea763236c903f e64313bcf0600cea90bc0935c74877cc - -dc rights - 0 0e230d9710d8294315f77233e2b11ef9 94f3688a12e01bb32bff91cf018e0aae - -dc type - 0 7ff1b8f5fc0daaa20ab922214e3a045c ff73c0e3a5d9de30b22e26cd2ad33690 - -dc date created - 0 8456da820291f3c79ab0a731d31c0ab0 05d06783110d7301f10255976adcd403 - -dc date issued - 0 2ae1f808bd0a7aece6f4b556409dae89 84e04b73bc0a73730da80dac2cea9c89 - -dc date available - 0 de063879f14b2f6a940ac64bb4ab05ec 3b6aed1e9b3556ae123bd845e505ce2d - -dc date modified - 0 a76d080f6129a423c6df9e494182464c 226f36d6861bb817387c603c5729a31e - -dc date - 0 813710616a3b633ca5d9096fa68a5cc8 d394bfc8003f9fa9e3260aad421295a2 - - - 0 9d11e23779d9604067dd20e359117573 3f883fb08147a237efaeb152bfaaa1ee - -sourcefile - 0 5cb6e562329ac330b9f52e75954e7e5a 20b427a149d1c6194b16d6da70314da4 - - - 0 794ce8cc422272c64a3d3821caf6c214 f2b54639d1bf563e60268bc3eb417571 - - - 0 9a680b4eda788b4a441a6e52414b7654 c1c11b4ceeca9d00dd57a13373646d9b - source b89ccdad9f6d9c2260d8d383d6b35ccc sisu_description.sst - - - 0 2b89344dabed7858bf918b451f4942f6 ba5182e9541159cc6a5b9b2c055b13e4 - skin 20fc43cf3eb6590bc3399a1aef65c5a9 skin_sisu_manual.rb - - - 0 5c5f06440fcb85cd5c8cfafe32e34b78 d8f5595da03dd0c38d62b56492f5eb0f - -document (metaverse) last generated - 0 59a1b267a5d403db2c2ff13c7bf51287 99146775ff3f96eb5b64f4c622d75525 - -generated by - 0 6c5edbc75adaa6dc22ef77d914d5b52e 857948399ac1b384fa1ec734c7b46782 - -ruby version - 0 37cf2ede6f4722230340c84e2753e412 4ccb32a8ff87fa8b6600884d16e2d685 ------------- -document structure[*] -:A - :B - 1 - 2 - 2 - 2 - 2 - 3 - 3 - 3 - 3 - 2 - 2 - 2 - 2 - 2 - 2 - 2 - 2 - 2 - 2 - 2 - 3 - 3 - 3 - 3 - 3 - 3 - 3 - 2 - 2 - 2 - 2 - 3 - 3 - 2 - 1 -:A - 1 - [*] heading levels ------------- -document structure[*] -@ = 15 -:A = 2 -:B = 1 -1 = 3 -2 = 20 -3 = 13 -objects (ocn) = 238 -endnotes = 71 - [*] number of headers (@) and of each heading level (:A to :C and 1 to 3) ------------- -Note: the time generated related fields (text and digests) will vary between otherwise identical document outputs diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/doc.html b/data/doc/manuals_generated/sisu_manual/sisu_description/doc.html deleted file mode 100644 index a9efd697..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/doc.html +++ /dev/null @@ -1,4760 +0,0 @@ - - - - - - SiSU - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    - - - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - -
    - - Full Text  scroll  - - - - - TOC linked  toc  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    -   -
    -

    - -

    - -
    - - -
    - -

    SiSU -

    - -

    Description -

    - -

    Ralph Amissah -

    - -copy @ - - SiSU - - -
    - - - -

    Rights: Copyright ©  Ralph Amissah 2007, part of SiSU documentation, License GPL 3

    -

    - -
    - -

    - SiSU - Description,
    Ralph Amissah
    -

    - -

    - SiSU an attempt to describe -

    - -

    - 1. Description -

    - -
    - 1.1 Outline -
    - -
    - 1.2 Short summary of features -
    - -
    - 1.3 How it works -
    - -
    - 1.4 Simple markup -
    - -
    - 1.4.1 Sparse markup requirement, try to get the most out of markup -
    - -
    - 1.4.2 Single markup file provides multiple output formats -
    - -
    - 1.4.3 Syntax relatively easy to read and remember -
    - -
    - 1.4.4 Kept simple by having a limited publishing feature set, and features identified as most important, are available across several document types -
    - -
    - 1.5 Designed with usability in mind -
    - -
    - 1.6 Code separate from content -
    - -
    - 1.7 Object citation numbering, a text or object positioning / citation system - "paragraph" (or text object) numbering, that remains same and usable across all output formats by people and machine -
    - -
    - 1.8 Handling of Dublin Core meta-tags making use of the Resource Description Framework -
    - -
    - 1.9 Easy directory management -
    - -
    - 1.10 Document Version Control Information -
    - -
    - 1.11 Table of contents -
    - -
    - 1.12 Auto-numbering of headings -
    - -
    - 1.13 Numbering and cross-hyperlinking of endnotes -
    - -
    - 1.14 "Skinnable" -
    - -
    - 1.15 Multiple Outputs -
    - -
    - 1.15.1 html - several presentations: full length & segmented; css & table based -
    - -
    - 1.15.2 XML -
    - -
    - 1.15.3 ODT:ODF, Open Document Format - ISO/IEC 26300:2006 -
    - -
    - 1.15.4 PDF - portrait and landscape, (through the generation of LaTeX output which is then transformed to pdf) -
    - -
    - 1.15.5 Search - loading/populating of relational database while retaining document structure information, object citation numbering and other features (currently PostgreSQL and/or SQLite) -
    - -
    - 1.15.6 Search - database frontend sample, utilising database and SiSU features, including object citation numbering (backend currently PostgreSQL) -
    - -
    - 1.15.7 Other forms -
    - -
    - 1.16 Concordance / Word Map or rudimentary index -
    - -
    - 1.17 Managed (document) directory, database, or site structure -
    - -
    - 1.18 Batch processing -
    - -
    - 1.19 Integration to superior Gnu/Linux and Unix tools -
    - -
    - 1.19.1 Backup and version control -
    - -
    - 1.19.2 Editor support -
    - -
    - 1.20 Modular design, need something new add a module -
    - -

    - -

    - - - Document Information (metadata) - - - - - -
    -

    - - SiSU - Description,
    Ralph Amissah -

    -
    -

    -   1 -

    -
    - - - - -
    -

    - - SiSU an attempt to describe -

    -
    -

    -   2 -

    -
    - - - - -
    -

    - - 1. Description -

    -
    -

    -   3 -

    -
    - - - - -
    -
    - - 1.1 Outline -
    -
    -

    -   4 -

    -
    - - - - -
    -

    - SiSU is a flexible document preparation, generation publishing and search system.  1  -

    -
    -

    -   5 -

    -
    - - - - -
    -

    - SiSU ("SiSU information Structuring Universe" or "Structured information, Serialized Units"),  2  is a Unix command line oriented framework for document structuring, publishing and search. Featuring minimalistic markup, multiple standard outputs, a common citation system, and granular search. -

    -
    -

    -   6 -

    -
    - - - - -
    -

    - Using markup applied to a document, SiSU can produce plain text, HTML, XHTML, XML, OpenDocument, LaTeX or PDF files, and populate an SQL database with objects  3  (equating generally to paragraph-sized chunks) so searches may be performed and matches returned with that degree of granularity (e.g. your search criteria is met by these documents and at these locations within each document). Document output formats share a common object numbering system for locating content. This is particularly suitable for "published" works (finalized texts as opposed to works that are frequently changed or updated) for which it provides a fixed means of reference of content. -

    -
    -

    -   7 -

    -
    - - - - -
    -

    - SiSU is the data/information structuring and transforming tool, that has resulted from work on one of the oldest law web projects. It makes possible the one time, simple human readable markup of documents, that SiSU can then publish in various forms, suitable for paper  4  , web  5  and relational database  6  presentations, retaining common data-structure and meta-information across the output/presentation formats. Several requirements of legal and scholarly publication on the web have been addressed, including the age old need to be able to reliably cite/pinpoint text within a document, to easily make footnotes/endnotes, to allow for semantic document meta-tagging, and to keep required markup to a minimum. These and other features of interest are listed and described below. A few points are worth making early (and will be repeated a number of times): -

    -
    -

    -   8 -

    -
    - - - - -
    -

    - (i) The SiSU document generator was the first to place material on the web with a system that makes possible citation across different document types, with paragraph, or rather object citation numbering  7  a text positioning system, available for the pinpointing of text, 1997, a simple idea from which much benefit, and SiSU remains today, to the best of my knowledge, the only multiple format e-book/ electronic-document system on the web that gives you this possibility (including for relational databases). -

    -
    -

    -   9 -

    -
    - - - - -
    -

    - (ii) Markup is done once for the multiple formats produced. -

    -
    -

    -   10 -

    -
    - - - - -
    -

    - (iii) Markup is simple, and human readable (with a little practice), in almost all cases there is less and simpler markup required than basic html. In any event the markup required is very much simpler than the html, LaTeX, [lout], structured XML, ODF (OpenDocument), PostgreSQL or SQLite feed etc. that you can have SiSU generate for you. -

    -
    -

    -   11 -

    -
    - - - - -
    -

    - (iv) SiSU is a batch processor, dealing with as many files as you need to generate at a time. -

    -
    -

    -   12 -

    -
    - - - - -
    -

    - (v) Scalability is dependent on your file system (in my case Reiserfs), the database (currently Postgresql and/or SQLite) and your hardware. -

    -
    -

    -   13 -

    -
    - - - - -
    -

    - SiSU Sabaki  8  (or just SiSU) is the provisional name given to the software described here that helps structure documents for web and other publication. The name SiSU is a loose anagram for something along the lines of "SiSU is structuring unit", or "SiSU, information structuring unit" or the more descriptive "Structured information, Serialized Units" or "simple - information structuring unit" or the more descriptive "Structured information, Serialized Units" or what it may be directed towards "semantic and information structuring universe"  9  tongue in cheek, only just. Guess I'll get away with "Simple - information Structuring Universe". SiSU is also a Finnish word roughly meaning guts, inner strength and perseverance.  10  -

    -
    -

    -   14 -

    -
    - - - - -
    -

    - SiSU was born of the need to find a way, with minimal effort, and for as wide a range of document types as possible, to produce high quality publishing output in a variety of document formats. As such it was necessary to find a simple document representation that would work across a large number of document types, and the most convenient way(s) to produce acceptable output formats. The project leading to this program was started in 1993 (together with the trade law project now known as Lex Mercatoria) as an investigation of how to effectively/efficiently place documents on the web. The unified document handling, together with features such as paragraph numbering, endnote handling and tables... appeared in 1996/97. SiSU was originally written in Perl,  11  and converted to Ruby,   12  in 2000, one of the most impressive programming languages in existence! In its current form it has been written to run on the Gnu /Linux platform, and in particular on Debian,   13  taking advantage of many of the wonderful projects that are available there. -

    -
    -

    -   15 -

    -
    - - - - -
    -

    - SiSU markup is based on requiring the minimum markup needed to determine the structure of a document. (This can be as little as saying in a header to look for the word Book at a specified level and the word Chapter at another level). SiSU then breaks a document into its smallest parts (at a heading, and paragraph level) while retaining all structural information. This break up of the document and information on its structure is taken advantage of in the transformations made in generating the very different output types that can be created, and in providing as much as can be for what each output type is best at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), ODF (OpenDocument [experimental]), SQL (e.g. document search; representing constituent parts of documents based on their structure, headings, chapters, paragraphs as required; user control).  14  -

    -
    -

    -   16 -

    -
    - - - - -
    -

    - From markup that is simpler and more sparse than html you get: -

    -
    -

    -   17 -

    -
    - - - - -
    -
  • - far greater output possibilities, including html, XML, ODF (OpenDocument), LaTeX (pdf), and SQL; -
  • -
    -

    -   18 -

    -
    - - - - -
    -
  • - the advantages implicit in the very different output possibilities; -
  • -
    -

    -   19 -

    -
    - - - - -
    -
  • - a common citation system (for all outputs - including the relational database, search results are relevant for all outputs); -
  • -
    -

    -   20 -

    -
    - - - - -
    -

    - For more see the short summary of features provided below. -

    -
    -

    -   21 -

    -
    - - - - -
    -

    - SiSU processes files with minimal tagging to produce various document outputs including html, LaTeX or lout (which is converted to pdf) and if required loads the structured information into an SQL database (PostgreSQL and SQLite have been used for this). SiSU produces an intermediate processing format.  15  -

    -
    -

    -   22 -

    -
    - - - - -
    -

    - SiSU is used in constructing Lex Mercatoria <http://lexmercatoria.org/> or <http://www.jus.uio.no/lm/> (one of the oldest law web sites), and considerable thought went into producing output that would be suitable for legal and academic writings (that do not have formulae) given the limitations of html, and publication in a wide variety of "formats", in particular in relation to the convenient and accurate citation of text. However, the construction of Lex Mercatoria uses only a fraction of the features available from SiSU today, vis generation of flat file structures, rather than in addition the building of ("granular") SQL database content, (at an object level with relevant relational tables, and other outputs also available). -

    -
    -

    -   23 -

    -
    - - - - -
    -
    - - 1.2 Short summary of features -
    -
    -

    -   24 -

    -
    - - - - -
    -

    - (i) markup syntax: (a) simpler than html, (b) mnemonic, influenced by mail/messaging/wiki markup practices, (c) human readable, and easily writable, -

    -
    -

    -   25 -

    -
    - - - - -
    -

    - (ii) (a) minimal markup requirement, (b) single file marked up for multiple outputs, -

    -
    -

    -   26 -

    -
    - - - - -
    -

    - notes: -

    -
    -

    -   27 -

    -
    - - - - -
    -

    - * documents are prepared in a single UTF-8 file using a minimalistic mnemonic syntax. Typical literature, documents like "War and Peace" require almost no markup, and most of the headers are optional. -

    -
    -

    -   28 -

    -
    - - - - -
    -

    - * markup is easily readable/parsed by the human eye, (basic markup is simpler and more sparse than the most basic html), [this may also be converted to XML representations of the same input/source document]. -

    -
    -

    -   29 -

    -
    - - - - -
    -

    - * markup defines document structure (this may be done once in a header pattern-match description, or for heading levels individually); basic text attributes (bold, italics, underscore, strike-through etc.) as required; and semantic information related to the document (header information, extended beyond the Dublin core and easily further extended as required); the headers may also contain processing instructions. -

    -
    -

    -   30 -

    -
    - - - - -
    -

    - (iii) (a) multiple outputs primarily industry established and institutionally accepted open standard formats, include amongst others: plaintext (UTF-8); html; (structured) XML; ODF (Open Document text)l; LaTeX; PDF (via LaTeX); SQL type databases (currently PostgreSQL and SQLite). Also produces: concordance files; document content certificates (md5 or sha256 digests of headings, paragraphs, images etc.) and html manifests (and sitemaps of content). (b) takes advantage of the strengths implicit in these very different output types, (e.g. PDFs produced using typesetting of LaTeX, databases populated with documents at an individual object/paragraph level, making possible granular search (and related possibilities)) -

    -
    -

    -   31 -

    -
    - - - - -
    -

    - (iv) outputs share a common numbering system (dubbed "object citation numbering" (ocn)) that is meaningful (to man and machine) across various digital outputs whether paper, screen, or database oriented, (PDF, html, XML, sqlite, postgresql), this numbering system can be used to reference content. -

    -
    -

    -   32 -

    -
    - - - - -
    -

    - (v) SQL databases are populated at an object level (roughly headings, paragraphs, verse, tables) and become searchable with that degree of granularity, the output information provides the object/paragraph numbers which are relevant across all generated outputs; it is also possible to look at just the matching paragraphs of the documents in the database; [output indexing also work well with search indexing tools like hyperesteier]. -

    -
    -

    -   33 -

    -
    - - - - -
    -

    - (vi) use of semantic meta-tags in headers permit the addition of semantic information on documents, (the available fields are easily extended) -

    -
    -

    -   34 -

    -
    - - - - -
    -

    - (vii) creates organised directory/file structure for (file-system) output, easily mapped with its clearly defined structure, with all text objects numbered, you know in advance where in each document output type, a bit of text will be found (e.g. from an SQL search, you know where to go to find the prepared html output or PDF etc.)... there is more; easy directory management and document associations, the document preparation (sub-)directory may be used to determine output (sub-)directory, the skin used, and the SQL database used, -

    -
    -

    -   35 -

    -
    - - - - -
    -

    - (viii) "Concordance file" wordmap, consisting of all the words in a document and their (text/ object) locations within the text, (and the possibility of adding vocabularies), -

    -
    -

    -   36 -

    -
    - - - - -
    -

    - (ix) document content certification and comparison considerations: (a) the document and each object within it stamped with an md5 hash making it possible to easily check or guarantee that the substantive content of a document is unchanged, (b)version control, documents integrated with time based source control system, default RCS or CVS with use of $Id: sisu_description.sst,v 1.25 2007/08/23 12:22:36 ralph Exp $ tag, which SiSU checks -

    -
    -

    -   37 -

    -
    - - - - -
    -

    - (x) SiSU's minimalist markup makes for meaningful "diffing" of the substantive content of markup-files, -

    -
    -

    -   38 -

    -
    - - - - -
    -

    - (xi) easily skinnable, document appearance on a project/site wide, directory wide, or document instance level easily controlled/changed, -

    -
    -

    -   39 -

    -
    - - - - -
    -

    - (xii) in many cases a regular expression may be used (once in the document header) to define all or part of a documents structure obviating or reducing the need to provide structural markup within the document, -

    -
    -

    -   40 -

    -
    - - - - -
    -

    - (xiii) prepared files may be batch process, documents produced are static files so this needs to be done only once but may be repeated for various reasons as desired (updated content, addition of new output formats, updated technology document presentations/representations) -

    -
    -

    -   41 -

    -
    - - - - -
    -

    - (xiv) possible to pre-process, which permits: the easy creation of standard form documents, and templates/term-sheets, or; building of composite documents (master documents) from other sisu marked up documents, or marked up parts, i.e. import documents or parts of text into a main document should this be desired -

    -
    -

    -   42 -

    -
    - - - - -
    -

    - there is a considerable degree of future-proofing, output representations are "upgradeable", and new document formats may be added. -

    -
    -

    -   43 -

    -
    - - - - -
    -

    - (xv) there is a considerable degree of future-proofing, output representations are "upgradeable", and new document formats may be added: (a) modular, (thanks in no small part to Ruby) another output format required, write another module.... (b) easy to update output formats (eg html, XHTML, LaTeX/PDF produced can be updated in program and run against whole document set), (c) easy to add, modify, or have alternative syntax rules for input, should you need to, -

    -
    -

    -   44 -

    -
    - - - - -
    -

    - (xvi) scalability, dependent on your file-system (ext3, Reiserfs, XFS, whatever) and on the relational database used (currently Postgresql and SQLite), and your hardware, -

    -
    -

    -   45 -

    -
    - - - - -
    -

    - (xvii) only marked up files need be backed up, to secure the larger document set produced, -

    -
    -

    -   46 -

    -
    - - - - -
    -

    - (xviii) document management, -

    -
    -

    -   47 -

    -
    - - - - -
    -

    - (xix) Syntax highlighting for SiSU markup is available for a number of text editors. -

    -
    -

    -   48 -

    -
    - - - - -
    -

    - (xx) remote operations: (a) run SiSU on a remote server, (having prepared sisu markup documents locally or on that server, i.e. this solution where sisu is installed on the remote server, would work whatever type of machine you chose to prepare your markup documents on), (b) generated document outputs may be posted by sisu to remote sites (using rsync/scp) (c)document source (plaintext utf-8) if shared on the net may be identified by its url and processed locally to produce the different document outputs. -

    -
    -

    -   49 -

    -
    - - - - -
    -

    - (xxi) document source may be bundled together (automatically) with associated documents (multiple language versions or master document with inclusions) and images and sent as a zip file called a sisupod, if shared on the net these too may be processed locally to produce the desired document outputs, these may be downloaded, shared as email attachments, or processed by running sisu against them, either using a url or the filename. -

    -
    -

    -   50 -

    -
    - - - - -
    -

    - (xxii) for basic document generation, the only software dependency is Ruby, and a few standard Unix tools (this covers plaintext, html, XML, ODF, LaTeX). To use a database you of course need that, and to convert the LaTeX generated to PDF, a LaTeX processor like tetex or texlive. -

    -
    -

    -   51 -

    -
    - - - - -
    -

    - as a developers tool it is flexible and extensible -

    -
    -

    -   52 -

    -
    - - - - -
    -

    - SiSU was developed in relation to legal documents, and is strong across a wide variety of texts (law, literature...). SiSU handles images but is not suitable for formulae/ statistics, or for technical writing at this time. -

    -
    -

    -   53 -

    -
    - - - - -
    -

    - SiSU has been developed and has been in use for several years. Requirements to cover a wide range of documents within its use domain have been explored. -

    -
    -

    -   54 -

    -
    - - - - -
    -

    - Some modules are more mature than others, the most mature being Html and LaTeX / pdf. PostgreSQL and search functions are useable and together with ocn unique (to the best of my knowledge). The XML output document set is "well formed" but largely proof of concept. -

    -
    -

    -   55 -

    -
    - - - - -
    -
    - - 1.3 How it works -
    -
    -

    -   56 -

    -
    - - - - -
    -

    - SiSU markup is fairly minimalistic, it consists of: a (largely optional) document header, made up of information about the document (such as when it was published, who authored it, and granting what rights) and any processing instructions; and markup within text which is related to document structure and typeface. SiSU must be able to discern the structure of a document, (text headings and their levels in relation to each other), either from information provided in the instruction header or from markup within the text (or from a combination of both). Processing is done against an abstraction of the document comprising of information on the document's structure and its objects,  16  which the program serializes (providing the object numbers) and which are assigned hash sum values based on their content. This abstraction of information about document structure, objects, (and hash sums), provides considerable flexibility in representing documents different ways and for different purposes (e.g. search, document layout, publishing, content certification, concordance etc.), and makes it possible to take advantage of some of the strengths of established ways of representing documents, (or indeed to create new ones). -

    -
    -

    -   57 -

    -
    - - - - -
    -
    - - 1.4 Simple markup -
    -
    -

    -   58 -

    -
    - - - - -
    -

    - SiSU markup is based on requiring the minimum markup needed to determine the structure of a document. (This can be as little as saying in a header to look for the word Book at a specified level and the word Chapter at another level). SiSU then breaks a document into its smallest parts (at a heading, and paragraph level) while retaining all structural information. This break up of the document and information on its structure is taken advantage of in the transformations made in generating the very different output types that can be created, and in providing as much as can be for what each output type is best at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), ODF (OpenDocument), SQL (e.g. document search; representing constituent parts of documents based on their structure, headings, chapters, paragraphs as required; user control).  17  -

    -
    -

    -   59 -

    -
    - - - - -
    -
    - - 1.4.1 Sparse markup requirement, try to get the most out of markup -
    -
    -

    -   60 -

    -
    - - - - -
    -

    - One of its strengths is that very small amounts of initial tagging is required for the program to generate its output. -

    -
    -

    -   61 -

    -
    - - - - -
    -

    - This is a basic markup example: -

    -
    -

    -   62 -

    -
    - - - - -
    -
  • - basic markup example, text file - an international convention   18  -
  • -
    -

    -   63 -

    -
    - - - - -
    -
  • - view basic markup, as it would be highlighted by vim editor   19  -
  • -
    -

    -   64 -

    -
    - - - - -
    -

    - Emphasis has been on simplicity and minimalism in markup requirements. Design philosophy is to try keep the amount of markup required low, for whatever has been determined to be acceptable output.  20  -

    -
    -

    -   65 -

    -
    - - - - -
    -

    - SiSU's markup is more minimalistic and simpler than (the equivalent) html and for it, you get considerably more than just html, as this preparation gives you all available output formats, upon request. -

    -
    -

    -   66 -

    -
    - - - - -
    -
    - - 1.4.2 Single markup file provides multiple output formats -
    -
    -

    -   67 -

    -
    - - - - -
    -

    - For each document, there is only one (input, minimalistically marked up) file from which all the available output types are generated.  21  -

    -
    -

    -   68 -

    -
    - - - - -
    -

    - Eg. the markup example: -

    -
    -

    -   69 -

    -
    - - - - -
    -
  • - original text file - an international convention   22  -
  • -
    -

    -   70 -

    -
    - - - - -
    -
  • - view as syntax would be highlighted by vim editor   23  -
  • -
    -

    -   71 -

    -
    - - - - -
    -

    - Produces the following output: -

    -
    -

    -   72 -

    -
    - - - - -
    -
  • - Segmented html version of document   24  -
  • -
    -

    -   73 -

    -
    - - - - -
    -
  • - Full length html document   25  -
  • -
    -

    -   74 -

    -
    - - - - -
    -
  • - pdf landscape version of document   26  -
  • -
    -

    -   75 -

    -
    - - - - -
    -
  • - pdf portrait version of document   27  -
  • -
    -

    -   76 -

    -
    - - - - -
    -
  • - clean tex ascii version of document   28  -
  • -
    -

    -   77 -

    -
    - - - - -
    -
  • - xml sax version of document   29  -
  • -
    -

    -   78 -

    -
    - - - - -
    -
  • - xml dom version of document   30  -
  • -
    -

    -   79 -

    -
    - - - - -
    -
  • - Concordance   31  -
  • -
    -

    -   80 -

    -
    - - - - -
    -

    - (and in addition to these: PostgreSQL, SQLite, texinfo and YAML   32  versions if desired) -

    -
    -

    -   81 -

    -
    - - - - -
    -
    - - 1.4.3 Syntax relatively easy to read and remember -
    -
    -

    -   82 -

    -
    - - - - -
    -

    - Syntax is kept simple and mnemonic.  33  -

    -
    -

    -   83 -

    -
    - - - - -
    -
    - - 1.4.4 Kept simple by having a limited publishing feature set, and features identified as most important, are available across several document types -
    -
    -

    -   84 -

    -
    - - - - -
    -

    - To keep SiSU markup sparse and simple SiSU deliberately provides a limited publishing feature set, including: indent levels; bold; italics; superscript; subscript; simple tables; images; tables of contents and; endnotes. Which in most cases are available across the different output formats. -

    -
    -

    -   85 -

    -
    - - - - -
    -

    - The publishing feature set may be expanded as required. -

    -
    -

    -   86 -

    -
    - - - - -
    -
    - - 1.5 Designed with usability in mind -
    -
    -

    -   87 -

    -
    - - - - -
    -

    - Output is designed to be uniform, easy to read, navigate and cite. -

    -
    -

    -   88 -

    -
    - - - - -
    -
    - - 1.6 Code separate from content -
    -
    -

    -   89 -

    -
    - - - - -
    -

    - Code  34  is separated from content. This means that when changes are desired in the output presentation, the code that produces them, and not the marked up text data set (which could be thousands of documents) is modified. Separating code from content makes large scale changes to output appearance trivial, and permits the easy addition of new output modules. -

    -
    -

    -   90 -

    -
    - - - - -
    -
    - - 1.7 Object citation numbering, a text or object positioning / citation system - "paragraph" (or text object) numbering, that remains same and usable across all output formats by people and machine -
    -
    -

    -   91 -

    -
    - - - - -
    -

    - Object citation numbering is a simple object (text) positioning and cition system that is human relevant and machine useable, used by SiSU for all manner of presentations, and that is available for use in all text mappings. It is based on the automated sequential numbering of objects (roughly paragraphs, (headings, tables, verse) or other blocks of text or images etc.). The text positioning system (in which I claim copyright) is invaluable for publishing requiring the citing text across multiple output formats, and for the general mapping of text within a document: -

    -
    -

    -   92 -

    -
    - - - - -
    -
  • - in html, html not being easily citeable (change font size, or use a different browser and the page on which specific text appears has changed), and -
  • -
    -

    -   93 -

    -
    - - - - -
    -
  • - across multiple formats being common to all output formats html/xml/pdf/sql output, -
  • -
    -

    -   94 -

    -
    - - - - -
    -
  • - the results of an sql search can just be "live" citation references to the documents in which the text is found, much like an index (see image examples provided).   35  -
  • -
    -

    -   95 -

    -
    - - - - -
    -

    - I claim copyright on the system I use which is the most basic of all, numbering all text in headings and paragraphs sequentially (with tables and images being treated as a single paragraph) and only footnotes/endnotes not following this numbering, as their position in text is not strictly determined, (a change from footnotes to endnotes would change their numbering), footnotes instead "belong" to the paragraph from which they are referenced, and have sequential numbers of their own. -

    -
    -

    -   96 -

    -
    - - - - -
    -

    - SiSU has a paragraph numbering system, that remains the same regardless of the output format. This provides an effective means of citation, pinpointing text accurately in all output formats, using the same reference. This is particularly useful where text has to be located across different output formats - for example once html is printed the number of pages and pages on which given text is found will vary depending on the browser, its settings the font size setting etc. Similarly SiSU produces pdf in different forms, eg. on the example site Lex Mercatoria as portrait and landscape documents - here too page numbering varies, but paragraph numbering is the same, vis a vis all versions of the text (portrait and landscape pdf and the html versions of the text, and as stored (with "paragraphs" as records) to the PostgreSQL or SQLite database). -

    -
    -

    -   97 -

    -
    - - - - -
    -

    - These numbers are placed in the text margins and are intended to be independent of and not to interfere with authors tagging. [The citation system (object citation numbering system, automated "paragraph numbering") which is automatically generated and is common and identical across all document formats] The paragraph numbering system is more accurately described as an (text) object numbering system, as headings are also numbered... all headings and paragraphs are numbered sequentially. Endnotes are automatically numbered independently and rather "belong" to the paragraph from which they are referenced, as an endnote does not (necessarily) form a part of a documents sequence, (they may be produced as either endnotes or footnotes (or both depending on what output you choose to look at - if you take the segmented html version document provided as an example, you will find that the endnotes are placed both at the end of each section, and in a separate section of their own called endnotes, and these are hyper-linked)). An attractive feature of providing citation numbering in this way is that it is independent of the document structure... it remains the same regardless of what is done about the document structure. -

    -
    -

    -   98 -

    -
    - - - - -
    -

    - The rules have been kept very simple, unique incremental object citation numbers are assigned to headings, paragraphs, verse, tables and images. It is possible to manually override this feature on a per heading or comment basis though this should be used exceptionally, it may be of use where there a substantive text, and the addition of a minor comment by the publisher that should not be mapped as part of the text. -

    -
    -

    -   99 -

    -
    - - - - -
    -

    - The object citation number markers contain additional numbering information with regard to the document structure, that can be used for alternative presentations, including such detail as the type of object (heading, paragraph, table, image, etc.), numbered sequentially. -

    -
    -

    -   100 -

    -
    - - - - -
    -

    - An advantage is that the numbering remains the same regardless of document structure. -

    -
    -

    -   101 -

    -
    - - - - -
    -

    - Text object ("paragraph") numbering is the same for all output versions of the same document, vis html, pdf, pgsql, yaml etc. -

    -
    -

    -   102 -

    -
    - - - - -
    -

    - In the relational database, as individual text objects of a document stored (and indexed) together with object numbers, and all versions of the document have the same numbering, the results of searches may be tailored just to provide the location of the search result in all available document formats. -

    -
    -

    -   103 -

    -
    - - - - -
    -

    - Note: there is a bug in the released behaviour of object citation numbering, (not certain when it was introduced) tables should be numbered, ie each table gets an ocn, required amongst other things for relational database. This will be corrected in a future release. Citation numbering of existing documents that contain tables will changed. -

    -
    -

    -   104 -

    -
    - - - - -
    -
    - - 1.8 Handling of Dublin Core meta-tags making use of the Resource Description Framework -
    -
    -

    -   105 -

    -
    - - - - -
    -

    - SiSU is able to use meta tags based on the Dublin Core  36  and Resource Description Framework  37  -

    -
    -

    -   106 -

    -
    - - - - -
    -

    - This provides the means of providing semantic information about a document, both as computer processable meta-tags, and as human readable information that may be of value for classification purposes. -

    -
    -

    -   107 -

    -
    - - - - -
    -

    - This information is provided both in html metatags, and (where available) under the section titled "Document Information - MetaData", near the end of a document, for example in the segmented html version of this text at: <http://www.jus.uio.no/sisu/SiSU/metadata.html> -

    -
    -

    -   108 -

    -
    - - - - -
    -
    - - 1.9 Easy directory management -
    -
    -

    -   109 -

    -
    - - - - -
    -

    - 1. Directory file association, skins and special image management, made simpler.  38  -

    -
    -

    -   110 -

    -
    - - - - -
    -

    - The last part of the name of the work directory in which markup is being done, or rather from where SiSU is run in order to generate document output, is used in determining the sub-directory name for output files, that is created in the document output directory. This provides a rather easy way to associate documents e.g. of a given subject, or by owner. -

    -
    -

    -   111 -

    -
    - - - - -
    -

    - -  /www/docs
    -      /intellectual_property
    -      /arbitration
    -      /contract_law

    -  /www/docs
    -      /ralph
    -      /sisu - -

    -
    -

    -   112 -

    -
    - - - - -
    -

    - all are placed in their own directories within the directory structure created. Similar rules are used in the creation of sql type databases (though they can be overridden). -

    -
    -

    -   113 -

    -
    - - - - -
    -

    - There are a couple of further associations with these directories. -

    -
    -

    -   114 -

    -
    - - - - -
    -

    - Directory wide skins. -

    -
    -

    -   115 -

    -
    - - - - -
    -

    - Directory specific images. -

    -
    -

    -   116 -

    -
    - - - - -
    -

    - 2. If there is a "directory skin", that is a skin of the same name as the directory, it is used in the generation of the documents within it, rather than the default skin, unless the document has a specific skin associated with it. -

    -
    -

    -   117 -

    -
    - - - - -
    -

    - a. default skin (always available) -

    -
    -

    -   118 -

    -
    - - - - -
    -

    - b. directory skin (precedence over default if exists) -

    -
    -

    -   119 -

    -
    - - - - -
    -

    - c. document skin (takes precedence wherever document requests a specific skin) -

    -
    -

    -   120 -

    -
    - - - - -
    -

    - Skins are defined in the document skin directory and if a directory association is desired a softlink made to the relevant skin. Skins (directory association auto load) auto load skin if a directory skin exists of same name as directory stub, (and there is no specific doc skin) -

    -
    -

    -   121 -

    -
    - - - - -
    -

    - 3. If the working directory has within it a sub-directory called image_local, the images within that directory are used for references to images, that are not part of the default site build. -

    -
    -

    -   122 -

    -
    - - - - -
    -
    - - 1.10 Document Version Control Information -
    -
    -

    -   123 -

    -
    - - - - -
    -

    - The possibility of citing an exact document version. -

    -
    -

    -   124 -

    -
    - - - - -
    -

    - Permits the inclusion of document version control information to the document body and metatags.  39  This provides a much more certain method of referring to the exact version of a particular document, (assuming that the document is from a trusted source, that will retain earlier versions of a document).  40  -

    -
    -

    -   125 -

    -
    - - - - -
    -

    - This information (where available) is provided under the section of the document titled "Document Information - MetaData", near the end of a document, for example in the segmented html version of this text at: <http://www.jus.uio.no/sisu/SiSU/metadata.html> -

    -
    -

    -   126 -

    -
    - - - - -
    -
    - - 1.11 Table of contents -
    -
    -

    -   127 -

    -
    - - - - -
    -

    - SiSU produces a rudimentary a table of contents based on document headings. -

    -
    -

    -   128 -

    -
    - - - - -
    -
    - - 1.12 Auto-numbering of headings -
    -
    -

    -   129 -

    -
    - - - - -
    -

    - Headings can be automatically numbered, (and automatically named for hyper-linking) -

    -
    -

    -   130 -

    -
    - - - - -
    -
    - - 1.13 Numbering and cross-hyperlinking of endnotes -
    -
    -

    -   131 -

    -
    - - - - -
    -

    - SiSU can automatically number footnotes/endnotes. This is the default operation where no number is provided. -

    -
    -

    -   132 -

    -
    - - - - -
    -

    - Footnotes/endnotes may also be manually numbered. Where a number, or numbers are provided for a footnote/endnote, this does not increment the automatic footnote/endnote number counter. -

    -
    -

    -   133 -

    -
    - - - - -
    -

    - In the html output footnotes/endnotes are cross-hyper-linked (to their reference point and vice versa). In th pdf output footnotes are linked from their reference point only. -

    -
    -

    -   134 -

    -
    - - - - -
    -
    - - 1.14 "Skinnable" -
    -
    -

    -   135 -

    -
    - - - - -
    -

    - SiSU is skinnable, on a site-wide, directory-wide and per document basis, so different looking versions of things may be produced with little difficulty. There is a default skin which may be modified, as the background site skin, and each working directory may have a skin associated with it, as may each individual document. The hierarchy of application is document, directory, then site... ie if a document skin exists it gets precedence. -

    -
    -

    -   136 -

    -
    - - - - -
    -

    - Whilst it is skinnable, the default output styles are selected to work across the widest possible range of document types. -

    -
    -

    -   137 -

    -
    - - - - -
    -
    - - 1.15 Multiple Outputs -
    -
    -

    -   138 -

    -
    - - - - -
    -

    - From markup that is simpler and more sparse than html you get: -

    -
    -

    -   139 -

    -
    - - - - -
    -
  • - far greater output possibilities, including multiple html types, XML (different structured types), LaTeX (pdf landscape, portrait), and SQL (Postgresql or SQLite or other); -
  • -
    -

    -   140 -

    -
    - - - - -
    -
  • - the advantages implicit in these very different output possibilities;  41  -
  • -
    -

    -   141 -

    -
    - - - - -
    -
  • - a common citation system -
  • -
    -

    -   142 -

    -
    - - - - -
    -

    - As many output formats/presentations as one cares to write modules for - several types of html (e.g. structure based on css, or structure based on tables); LaTeX/pdf and Lout/pdf; pgsql other databases easily added; yaml... -

    -
    -

    -   143 -

    -
    - - - - -
    -
    - - 1.15.1 html - several presentations: full length & segmented; css & table based -
    -
    -

    -   144 -

    -
    - - - - -
    -

    - Most documents are produced in single and segmented html versions, described below: -

    -
    -

    -   145 -

    -
    - - - - -
    -

    - The Scroll (full length text presentations) -

    -
    -

    -   146 -

    -
    - - - - -
    -

    - The full length of the text in a single scrollable document.  42  As a rule the files they are saved in are named: doc or more precisely doc.html -

    -
    -

    -   147 -

    -
    - - - - -
    -

    - For various reasons texts may only be provided in this form (such as this one which is short), though most are also provided as segmented texts. -

    -
    -

    -   148 -

    -
    - - - - -
    -

    - "Scroll" is a reference to the historical scroll, a single long document/ parchment, and also no doubt to what you will have to do to get to the bottom of the text.  43  -

    -
    -

    -   149 -

    -
    - - - - -
    -

    - The Segmented Text -

    -
    -

    -   150 -

    -
    - - - - -
    -

    - The text divided into segments (such as articles or chapters depending on the text)  44  As a rule the files they are saved in are named: toc and index or more precisely toc.html and index.html -

    -
    -

    -   151 -

    -
    - - - - -
    -

    - If you know exactly what you are looking for, loading a segment of text is faster (the segments being smaller). Occasionally longer documents such as the WTA 1994 <http://www.jus.uio.no/lm/wta.1994/toc> are only provided in segmented form. -

    -
    -

    -   152 -

    -
    - - - - -
    -

    - Cascading Style Sheet, and Table based html -

    -
    -

    -   153 -

    -
    - - - - -
    -

    - SiSU outputs html, two current standard forms available are: -

    -
    -

    -   154 -

    -
    - - - - -
    -

    - css based -

    -
    -

    -   155 -

    -
    - - - - -
    -

    - and -

    -
    -

    -   156 -

    -
    - - - - -
    -

    - table based [largely discontinued ]  45  -

    -
    -

    -   157 -

    -
    - - - - -
    -

    - The html is tested across several browsers -

    -
    -

    -   158 -

    -
    - - - - -
    -

    - I like to remind you that there are other excellent browsers out there, many of which have long supported practical features like tabbing. -

    -
    -

    -   159 -

    -
    - - - - -
    -

    - The html is tested across several browsers, including: -

    -
    -

    -   160 -

    -
    - - - - -
    -
  • - Firefox (Mozilla-Firefox)   46  -
  • -
    -

    -   161 -

    -
    - - - - -
    -
  • - Kazehakase   47  -
  • -
    -

    -   162 -

    -
    - - - - -
    -
  • - Konqueror   48  -
  • -
    -

    -   163 -

    -
    - - - - -
    -
  • - Mozilla   49  -
  • -
    -

    -   164 -

    -
    - - - - -
    -
  • - MS Internet Explorer   50  -
  • -
    -

    -   165 -

    -
    - - - - -
    -
  • - Netscape   51  -
  • -
    -

    -   166 -

    -
    - - - - -
    -
  • - Opera   52  -
  • -
    -

    -   167 -

    -
    - - - - -
    -

    - Also lighter weight graphical browsers: -

    -
    -

    -   168 -

    -
    - - - - -
    -
  • - Dillo   53  -
  • -
    -

    -   169 -

    -
    - - - - -
    -
  • - Epiphany   54  -
  • -
    -

    -   170 -

    -
    - - - - -
    -
  • - Galeon   55  -
  • -
    -

    -   171 -

    -
    - - - - -
    -

    - And for console/text browsing: -

    -
    -

    -   172 -

    -
    - - - - -
    -
  • - elinks   56  -
  • -
    -

    -   173 -

    -
    - - - - -
    -
  • - links2   57  -
  • -
    -

    -   174 -

    -
    - - - - -
    -
  • - w3m   58  -
  • -
    -

    -   175 -

    -
    - - - - -
    -

    - The html tables output is rendered more accurately across a wider variety set and older versions of browsers (than the html css output). -

    -
    -

    -   176 -

    -
    - - - - -
    -
    - - 1.15.2 XML -
    -
    -

    -   177 -

    -
    - - - - -
    -

    - SiSU generates well formed XML, and multiple versions. An XML SAX version with a flat/shallow structure, and XML DOM version with a deeper (embedded) structure. There is also a released working xhtml module. Examples of SAX and DOM versions are provided within this document. -

    -
    -

    -   178 -

    -
    - - - - -
    -
    - - 1.15.3 ODT:ODF, Open Document Format - ISO/IEC 26300:2006 -
    -
    -

    -   179 -

    -
    - - - - -
    -

    - SiSU generates Open Document Output format. -

    -
    -

    -   180 -

    -
    - - - - -
    -
    - - 1.15.4 PDF - portrait and landscape, (through the generation of LaTeX output which is then transformed to pdf) -
    -
    -

    -   181 -

    -
    - - - - -
    -

    - SiSU outputs LaTeX if required which is easily transformed to PDF.  59  PDF documents are generated on the site from the same source files and Ruby program that produce html. Landscape oriented pdf introduced, providing easier screen viewing, they are also (paper saving, being currently) formatted to have fewer pages than their portrait equivalents. -

    -
    -

    -   182 -

    -
    - - - - -
    -
  • - Adobe Reader   60  -
  • -
    -

    -   183 -

    -
    - - - - -
    -
  • - Evince   61  -
  • -
    -

    -   184 -

    -
    - - - - -
    -
  • - xpdf   62  -
  • -
    -

    -   185 -

    -
    - - - - -
    -
    - - 1.15.5 Search - loading/populating of relational database while retaining document structure information, object citation numbering and other features (currently PostgreSQL and/or SQLite) -
    -
    -

    -   186 -

    -
    - - - - -
    -

    - SiSU (from the same markup input file) automatically feeds into PostgreSQL  63  and/or SQLite  64  database (could be any other of the better relational databases)  65  - together with all additional information related to document structure, and the alternative ways in which it is generated on the site retained. As regards scaling of the database, it is as scalable as the database (here Postgresql or SQLite) and hardware allow. I will prune the images later. -

    -
    -

    -   187 -

    -
    - - - - -
    -

    - This is one of the more interesting output forms, as all the structural data for the documents are retained (though can be ignored by the user of the database should they so choose). All site texts/documents are (currently) streamed to four pgsql database tables: -

    -
    -

    -   188 -

    -
    - - - - -
    -
  • - one containing semantic (and other) headers, including, title, author, subject, (the Dublin Core...); -
  • -
    -

    -   189 -

    -
    - - - - -
    -
  • - another the substantive texts by individual "paragraph" (or object) - along with structural information, each paragraph being identifiable by its paragraph number (if it has one which almost all of them do), and the substantive text of each paragraph quite naturally being searchable (both in formatted and clean text versions for searching); and -
  • -
    -

    -   190 -

    -
    - - - - -
    -
  • - a third containing endnotes cross-referenced back to the paragraph from which they are referenced (both in formatted and clean text versions for searching). -
  • -
    -

    -   191 -

    -
    - - - - -
    -
  • - a fourth table with a one to one relation with the headers table contains full text versions of output, eg. pdf, html, xml, and ascii. -
  • -
    -

    -   192 -

    -
    - - - - -
    -

    - There is of course the possibility to add further structures. -

    -
    -

    -   193 -

    -
    - - - - -
    -

    - At this level SiSU loads a relational database with documents broken in to their smallest logical structurally constituent parts, as text objects, with their object citation number and all other structural information needed to construct the structured document. Text is stored (at this text object level) with and without elementary markup tagging, the stripped version being so as to facilitate ease of searching. -

    -
    -

    -   194 -

    -
    - - - - -
    -

    - Because the document structure of sites created is clearly defined, and the text object citation system is available for all forms of output, it is possible to search the sql database, and either read results from that database, or just as simply map the results to the html output, which has richer text markup. -

    -
    -

    -   195 -

    -
    - - - - -
    -

    - The combination of the SiSU citation system with a relational database is pretty powerful, giving rise to several possibilities. As individual text objects of a document stored (and indexed) together with object numbers, and all versions of the document have the same numbering, complex searches can be tailored to return just the locations of the search results relevant for all available output formats, with live links to the precise locations in the database or in html/xml documents; or, the structural information provided makes it possible to search the full contents of the database and have headings in which search content appears, or to search only headings etc. (as the Dublin Core is incorporated it is easy to make use of that as well). -

    -
    -

    -   196 -

    -
    - - - - -
    -

    - This is a larger scale project, (with little development on the front end largely ignored), though the "infrastructure" has been in place since 2002. -

    -
    -

    -   197 -

    -
    - - - - -
    -
    - - 1.15.6 Search - database frontend sample, utilising database and SiSU features, including object citation numbering (backend currently PostgreSQL) -
    -
    -

    -   198 -

    -
    - - - - -
    -

    - Sample search frontend   66  A small database and sample query front-end (search from) that makes use of the citation system, object citation numbering to demonstrates functionality.  67  -

    -
    -

    -   199 -

    -
    - - - - -
    -

    - SiSU can provide information on which documents are matched and at what locations within each document the matches are found. These results are relevant across all outputs using object citation numbering, which includes html, XML, LaTeX, PDF and indeed the SQL database. You can then refer to one of the other outputs or in the SQL database expand the text within the matched objects (paragraphs) in the documents matched. -

    -
    -

    -   200 -

    -
    - - - - -
    -

    - (further work needs to be done on the sample search form, which is rudimentary and only passes simple booleans correctly at present to the SQL engine) -

    -
    -

    -   201 -

    -
    - - - - -
    -

    - A few canned searches, showing object numbers. Search for: -

    -
    -

    -   202 -

    -
    - - - - -
    -

    - English documents matching Linux OR Debian -

    -
    -

    -   203 -

    -
    - - - - -
    -

    - GPL OR Richard Stallman -

    -
    -

    -   204 -

    -
    - - - - -
    -

    - invention OR innovation in English language -

    -
    -

    -   205 -

    -
    - - - - -
    -

    - copyright in English language documents -

    -
    -

    -   206 -

    -
    - - - - -
    -

    - Note that the searches done in this form are case sensitive. -

    -
    -

    -   207 -

    -
    - - - - -
    -

    - Expand those same searches, showing the matching text in each document: -

    -
    -

    -   208 -

    -
    - - - - -
    -

    - English documents matching Linux OR Debian -

    -
    -

    -   209 -

    -
    - - - - -
    -

    - GPL OR Richard Stallman -

    -
    -

    -   210 -

    -
    - - - - -
    -

    - invention OR innovation in English language -

    -
    -

    -   211 -

    -
    - - - - -
    -

    - copyright in English language documents -

    -
    -

    -   212 -

    -
    - - - - -
    -

    - Note you may set results either for documents matched and object number locations within each matched document meeting the search criteria; or display the names of the documents matched along with the objects (paragraphs) that meet the search criteria.  68  -

    -
    -

    -   213 -

    -
    - - - - -
    -

    - OCN index mode, (object citation number) the numbers displayed are relevant (and may be used to reference the match) in any sisu generated rendition of the text  69  the links provided are to the locations of matches within the html generated by SiSU. -

    -
    -

    -   214 -

    -
    - - - - -
    -

    - Paragraph mode, you may alternatively display the text of each paragraph in which the match was made, again the object/paragraph numbers are relevant to any SiSU generated/published text. -

    -
    -

    -   215 -

    -
    - - - - -
    -

    - Several options for output - select database to search, show results in index view (links to locations within text), show results with text, echo search in form, show what was searched, create and show a "canned url" for search, show available search fields. Also shows counters number of documents in which found and number of locations within documents where found. [could consider sorting by document with most occurrences of the search result]. -

    -
    -

    -   216 -

    -
    - - - - -
    -

    - Earlier version of the search frontend - Simple search, results with files in which search found, and locations where found within files. -

    -
    -

    -   217 -

    -
    - - - - -
    -

    - Simple search, results with files in which search found, and text object (paragraph or endnote) where found within files. -

    -
    -

    -   218 -

    -
    - - - - -
    -
    - - 1.15.7 Other forms -
    -
    -

    -   219 -

    -
    - - - - -
    -

    - There are other forms as well, YAML file, Ruby Marshal dumps, document pre-processing (processing of documents prior to the steps described here, to produce input suitable for the program) snap in a new module as required/desired, well formed XML, no problem. -

    -
    -

    -   220 -

    -
    - - - - -
    -
    - - 1.16 Concordance / Word Map or rudimentary index -
    -
    -

    -   221 -

    -
    - - - - -
    -

    - Concordance /WordMaps:  70  SiSU produces a rudimentary index based on the words within the text, making use of paragraph numbers to identify text locations. This is generated in html and hyper-linked but identifies these words locations in the other document formats. Though it is possible to search using a search engine, this is a means for browsing an alphabetical list of words which may suggest other useful content. -

    -
    -

    -   222 -

    -
    - - - - -
    -
    - - 1.17 Managed (document) directory, database, or site structure -
    -
    -

    -   223 -

    -
    - - - - -
    -

    - SiSU builds the web site (or more generically provides a suitable directory structure) - placing various output texts in the hierarchy of the web-site (or db), which (for directories) is a sub-directory with the name of the text file. -

    -
    -

    -   224 -

    -
    - - - - -
    -
    - - 1.18 Batch processing -
    -
    -

    -   225 -

    -
    - - - - -
    -

    - SiSU is a batch processing tool, handling and transforming multiple (or individual) documents (in many ways) with a single instruction. -

    -
    -

    -   226 -

    -
    - - - - -
    -
    - - 1.19 Integration to superior Gnu/Linux and Unix tools -
    -
    -

    -   227 -

    -
    - - - - -
    -

    - As should have been noted by the above description of SiSU, it makes use of existing programs found on Gnu /Linux and Unix, amongst those already mentioned include the LaTeX to pdf converters and the database PostgreSQL or SQLite. -

    -
    -

    -   228 -

    -
    - - - - -
    -
    - - 1.19.1 Backup and version control -
    -
    -

    -   229 -

    -
    - - - - -
    -

    - Unix provides many tools for version control. For documents Subversion, CVS and even the old RCS are useful for the per-document histories they provide. -

    -
    -

    -   230 -

    -
    - - - - -
    -

    - For writing code superior (more recent) version control system exist. These can also be used for documents though they tend to take stamps of changes across the repository as a whole, rather than for each individual file that is tracked, (as CVS and RCS do). My personal preference is for distributed systems such as Git, Mercurial or Darcs, of which I use Git for both code and documents. -

    -
    -

    -   231 -

    -
    - - - - -
    -

    - Several backup tools exist. At the base level I tend to use rdiff. -

    -
    -

    -   232 -

    -
    - - - - -
    -
    - - 1.19.2 Editor support -
    -
    -

    -   233 -

    -
    - - - - -
    -

    - SiSU documents are prepared / marked up in utf-8 text you are free to use the text editor of your choice. -

    -
    -

    -   234 -

    -
    - - - - -
    -

    - Syntax highlighting for a number of editors are provided. Amongst them Vim, Kwrite, Kate, Gedit and diakonos. These may be found with configuration instructions at <http://www.jus.uio.no/sisu/syntax_highlight>. Vim   71  as of version 7 has built in sytax highlighting for SiSU. -

    -
    -

    -   235 -

    -
    - - - - -
    -
    - - 1.20 Modular design, need something new add a module -
    -
    -

    -   236 -

    -
    - - - - -
    -

    - Need a new output format that does not already exist, write a new module. -

    -
    -

    -   237 -

    -
    - - - - -
    -

    - Prefer a new input syntax, you could write a new syntax matching the existing design, though my personal preference is some uniformity in entry appearance. If necessary has been fairly easy to extend the design parameters. It is intended to incorporate some additional basic semantic tagging, (book, article, author etc.) However, keeping the requirements for input minimal, and relatively simple has been a design goal. -

    -
    -

    -   238 -

    -
    - - - - -
    -

    - - Endnotes -

    -
    -

    -   0 -

    -
    - - - -
    - -

    -


    -

    -   -
    - - - - -
    -

    -  1. This information was first placed on the web 12 November 2002; with predating material taken from <http://www.jus.uio.no/lm/lm.information/toc.html> part of a site started and developed since 1993. See document metadata section <http://www.jus.uio.no/sisu/SiSU/metadata.html> for information on this version. Dates related to the development of SiSU are mostly contained within the Chronology section of this document, e.g. <http://www.jus.uio.no/sisu/sisu_chronology> -

    -
    -   -
    - - - - -
    -

    -  2. also chosen for the meaning of the Finnish term "sisu". -

    -
    -   -
    - - - - -
    -

    -  3. objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. -

    -
    -   -
    - - - - -
    -

    -  4. pdf via LaTeX or lout -

    -
    -   -
    - - - - -
    -

    -  5. currently html (two forms of html presentation one based on css the other on tables), and PHP; potentially structured XML -

    -
    -   -
    - - - - -
    -

    -  6. any SQL - currently PostgreSQL and sqlite (for portability, testing and development) -

    -
    -   -
    - - - - -
    -

    -  7. previously called "text object numbering" -

    -
    -   -
    - - - - -
    -

    -  8. SiSU Sabaki, release version. Pre-release version SiSU Scribe, and version prior to that SiSU nicknamed Scribbler. Pre-release versions go back several years. Both Scribbler and Scribe (still maintained) made system calls to SiSU's various parts, instead of using libraries. -

    -
    -   -
    - - - - -
    -

    -  9. A little universe it may be, but semantic you may have a hard time getting away with, given the meaning the word has taken on with markup. On a document wide basis semantic information may be provided, which can be really useful, (and meaningful, especially) if you have a large document set, and use this with rss feeds or in an sql database etc. On a markup level, I have little inclination to add semantic markup formally beyond references, title, author [Dublin Core entities? addresses?] etc. Actually this deserves a bit of thought possibly use letter tags (including letter alias/synonyms for font faces) to create a small set of default semantic tags, with the possibility for per document adjustments. Will seek to permit XML entity tagging, within SiSU markup and have that ignored/removed by the parts of the program that have no use for it. -

    -
    -   -
    - - - - -
    -

    -  10. "Sisu refers not to the courage of optimism, but to a concept of life that says, 'I may not win, but I will gladly give my life for what I believe.'" Aini Rajanen, Of Finnish Ways, 1981, p. 10.
    <http://www.humanlanguages.com/finnishenglish/rlfs.htm>
    "Every Finn has his own pet definition. To me, sisu means patience without passion. But there are many varieties of sisu. Sisu can be a sudden outburst or it can be the kind that lasts. A man can have both kinds. It is outside reason. It is something in the soul. It comes from oneself. For instance, it makes a soldier do things because he himself must, not because he has been told." Paavo Nurmi
    <http://personalweb.smcvt.edu/tmatikainen/finnishtraditions.htm> -

    -
    -   -
    - - - - -
    -

    -  11. <http://www.perl.org/> -

    -
    -   -
    - - - - -
    -

    -  12. <http://www.ruby-lang.org/en/> -

    -
    -   -
    - - - - -
    -

    -  13. <http://www.debian.org/> -

    -
    -   -
    - - - - -
    -

    -  14. where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. -

    -
    -   -
    - - - - -
    -

    -  15. This proved to be the easiest way to develop syntax, changes could be made, or alternatives provided for the markup syntax whilst the intermediate markup syntax was largely held constant. There is actually an optional second intermediate markup format in YAML <http://www.yaml.org/> -

    -
    -   -
    - - - - -
    -

    -  16. objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. -

    -
    -   -
    - - - - -
    -

    -  17. where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. -

    -
    -   -
    - - - - -
    -

    -  18. <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> output provided as example in the next section -

    -
    -   -
    - - - - -
    -

    -  19. <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> as it would appear with syntax highlighting (by vim) -

    -
    -   -
    - - - - -
    -

    -  20. seems there are several "smart ASCIIs" available, primarily for ascii to html conversion, that make this, and reasonable looking ascii their goal
    <http://webseitz.fluxent.com/wiki/SmartAscii>
    <http://daringfireball.net/projects/markdown/>
    <http://www.textism.com/tools/textile/> -

    -
    -   -
    - - - - -
    -

    -  21. These include richly laid out and linked html (table or css variants), PHP, LaTeX (from which pdf portrait and landscape documents are produced), texinfo (for info files etc.), and PostgreSQL and/or SQLite. And the opportunity to fairly easily build additional modules, such as XML. See the examples provided in this document. -

    -
    -   -
    - - - - -
    -

    -  22. <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> -

    -
    -   -
    - - - - -
    -

    -  23. <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> -

    -
    -   -
    - - - - -
    -

    -  24. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html> -

    -
    -   -
    - - - - -
    -

    -  25. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/doc.html> -

    -
    -   -
    - - - - -
    -

    -  26. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/landscape.pdf> -

    -
    -   -
    - - - - -
    -

    -  27. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/portrait.pdf> -

    -
    -   -
    - - - - -
    -

    -  28. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/plain.txt> -

    -
    -   -
    - - - - -
    -

    -  29. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/sax.xml> -

    -
    -   -
    - - - - -
    -

    -  30. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/dom.xml> -

    -
    -   -
    - - - - -
    -

    -  31. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/concordance.html> -

    -
    -   -
    - - - - -
    -

    -  32. discontinued for the time being -

    -
    -   -
    - - - - -
    -

    -  33. SiSU markup syntax, an incomplete summary: <http://www.jus.uio.no/sisu/sisu_markup_table/doc.html#h200306>
    Visual check of elementary font face modifiers: bold bold emphasis italics underscore strikethrough superscript subscript -

    -
    -   -
    - - - - -
    -

    -  34. the program that generates the documents -

    -
    -   -
    - - - - -
    -

    -  35. <http://www.jus.uio.no/sisu/SiSU/1.html#search> -

    -
    -   -
    - - - - -
    -

    -  36. <http://dublincore.org/> -

    -
    -   -
    - - - - -
    -

    -  37. <http://www.w3.org/RDF/> -

    -
    -   -
    - - - - -
    -

    -  38. The previous way was directory associations for file output were set up in the configuration file. The present system is a more natural way to work requireing less configuration. -

    -
    -   -
    - - - - -
    -

    -  39. from a version control system such as CVS -

    -
    -   -
    - - - - -
    -

    -  40. The version control system must be run, so the version number is obtained, prior to the SiSU document generation, and subsequent posting of the document. -

    -
    -   -
    - - - - -
    -

    -  41. e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), SQL (e.g. document set searches; representation of the constituent parts of documents based on their structure, headings, chapters, paragraphs as desired; control of use) -

    -
    -   -
    - - - - -
    -

    -  42. CISG <http://www.jus.uio.no/lm/un_contracts_international_sale_of_goods_convention_1980/doc>
    The Unidroit Contract Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994/doc> or
    The Autonomous Contract <http://www.jus.uio.no/lm/autonomous.contract.2000.amissah/doc> -

    -
    -   -
    - - - - -
    -

    -  43. Scrolling is not however necessarily confined to full length documents as you will have to scroll to get to the bottom of any long segment (eg. chapter) of a segmented text. -

    -
    -   -
    - - - - -
    -

    -  44. CISG <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980>
    The Unidroit Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994>
    The Autonomous Contract <http://www.jus.uio.no/sisu/the.autonomous.contract.2000.amissah> or
    WTA 1994 <http://www.jus.uio.no/lm/wta.1994> -

    -
    -   -
    - - - - -
    -

    -  45. formatting possibility still exists in code tree but maintenance has been largely discontinuted. -

    -
    -   -
    - - - - -
    -

    -  46. <http://www.mozilla.org/products/firefox/> -

    -
    -   -
    - - - - -
    -

    -  47. <http://kazehakase.sourceforge.jp/> -

    -
    -   -
    - - - - -
    -

    -  48. <http://www.konqueror.org/> -

    -
    -   -
    - - - - -
    -

    -  49. <http://www.mozilla.org/> -

    -
    -   -
    - - - - -
    -

    -  50. <http://www.microsoft.com/windows/ie/default.asp> -

    -
    -   -
    - - - - -
    -

    -  51. <http://home.netscape.com/comprod/mirror/client_download.html> -

    -
    -   -
    - - - - -
    -

    -  52. <http://www.opera.com/> -

    -
    -   -
    - - - - -
    -

    -  53. <http://www.dillo.org/> -

    -
    -   -
    - - - - -
    -

    -  54. <http://www.gnome.org/projects/epiphany/> -

    -
    -   -
    - - - - -
    -

    -  55. <http://galeon.sourceforge.net/> -

    -
    -   -
    - - - - -
    -

    -  56. <http://elinks.or.cz/> -

    -
    -   -
    - - - - -
    -

    -  57. <http://links.twibright.com/> -

    -
    -   -
    - - - - -
    -

    -  58. <http://w3m.sourceforge.net/> -

    -
    -   -
    - - - - -
    -

    -  59. LaTeX and pdf features introduced 18th June 2001, Landscape and portrait pdfs introduced 7th October 2001., Lout is a more recent addition 22th April 2003 -

    -
    -   -
    - - - - -
    -

    -  60. <http://www.adobe.com/products/acrobat/readstep2.html> -

    -
    -   -
    - - - - -
    -

    -  61. <http://www.gnome.org/projects/evince/> -

    -
    -   -
    - - - - -
    -

    -  62. <http://www.foolabs.com/xpdf/> -

    -
    -   -
    - - - - -
    -

    -  63. <http://www.postgresql.org/>
    <http://advocacy.postgresql.org/>
    <http://en.wikipedia.org/wiki/Postgresql> -

    -
    -   -
    - - - - -
    -

    -  64. <http://www.hwaci.com/sw/sqlite/>
    <http://en.wikipedia.org/wiki/Sqlite> -

    -
    -   -
    - - - - -
    -

    -  65. Relational database features retaining document structure and citation introduced 15th July 2002 -

    -
    -   -
    - - - - -
    -

    -  66. <http://search.sisudoc.org> -

    -
    -   -
    - - - - -
    -

    -  67. (which could be extended further with current back-end). As regards scaling of the database, it is as scalable as the database (here Postgresql) and hardware allow. -

    -
    -   -
    - - - - -
    -

    -  68. of this feature when demonstrated to an IBM software innovations evaluator in 2004 he said to paraphrase: this could be of interest to us. We have large document management systems, you can search hundreds of thousands of documents and we can tell you which documents meet your search criteria, but there is no way we can tell you without opening each document where within each your matches are found. -

    -
    -   -
    - - - - -
    -

    -  69. OCN are provided for HTML, XML, pdf ... though currently omitted in plain-text and opendocument format output -

    -
    -   -
    - - - - -
    -

    -  70. Concordance/ WordMaps introduced 15th August 2002 -

    -
    -   -
    - - - - -
    -

    -  71. <http://www.vim.org/> -

    -
    -   -
    - - - - -
    -

    - - Document Information (metadata) -

    -
    -

    -   0 -

    -
    - -

    - Metadata -

    - - -     - - - - - -
    -

    - <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/sisu_manifest.html> -

    -
    - - - -
    -

    - Dublin Core (DC) -

    -
    - - - -
    -

    - DC tags included with this document are provided here. -

    -
    - - - -
    -

    - DC Title: SiSU - Description -

    -
    - - - -
    -

    - DC Creator: Ralph Amissah -

    -
    - - - -
    -

    - DC Rights: Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3 -

    -
    - - - -
    -

    - DC Type: information -

    -
    - - - -
    -

    - DC Date created: 2002-11-12 -

    -
    - - - -
    -

    - DC Date issued: 2002-11-12 -

    -
    - - - -
    -

    - DC Date available: 2002-11-12 -

    -
    - - - -
    -

    - DC Date modified: 2007-08-30 -

    -
    - - - -
    -

    - DC Date: 2007-08-30 -

    -
    - - - -
    -

    - Version Information -

    -
    - - - -
    -

    - Sourcefile: sisu_description.sst -

    -
    - - - -
    -

    - Filetype: SiSU text 0.57 -

    -
    - - - -
    -

    - Sourcefile Digest, MD5(sisu_description.sst)= b89ccdad9f6d9c2260d8d383d6b35ccc -

    -
    - - - -
    -

    - Skin_Digest: MD5(/home/ralph/grotto/theatre/dbld/builds/sisu/sisu/data/doc/sisu/sisu_markup_samples/sisu_manual/_sisu/skin/doc/skin_sisu_manual.rb)= 20fc43cf3eb6590bc3399a1aef65c5a9 -

    -
    - - - -
    -

    - Generated -

    -
    - - - -
    -

    - Document (metaverse) last generated: Tue Sep 25 02:54:06 +0100 2007 -

    -
    - - - -
    -

    - Generated by: SiSU 0.59.1 of 2007w39/2 (2007-09-25) -

    -
    - - - -
    -

    - Ruby version: ruby 1.8.6 (2007-06-07 patchlevel 36) [i486-linux] -

    -
    - - - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - -
    - - Full Text  scroll  - - - - - TOC linked  toc  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    -   -
    -

    - - - - - - - - -
    - -
    -

    - - SiSU - -

    -
    - -

    - Output generated by - - SiSU - - 0.59.1 2007-09-25 (2007w39/2) -
    - - SiSU - - Copyright © Ralph Amissah - 1997, current 2007. - All Rights Reserved. -
    - - SiSU - - is software for document structuring, publishing and search, -
    - - www.jus.uio.no/sisu - - and - - www.sisudoc.org - -
    - w3 since October 3 1993 - - ralph@amissah.com - -

    -
    -

    - SiSU using: -
    Standard SiSU markup syntax, -
    Standard SiSU meta-markup syntax, and the -
    Standard SiSU object citation numbering and system, (object/text positioning system) -
    - Copyright © Ralph Amissah 1997, current 2007. - All Rights Reserved. -

    -
    -

    - - GPLv3 - -

    -
    -

    - - SiSU - - is released under - GPLv3 - or later, - <http://www.gnu.org/licenses/gpl.html> -

    -
    -

    - SiSU, developed using - - Ruby - - on - - Debian/Gnu/Linux - - software infrastructure, - with the usual GPL (or OSS) suspects. -
    - Better - "performance, reliability, scalability, security & total cost of ownership" - [not to mention flexibility & choice] use of and adherence to open standards (where practical and fair) and it is software libre. -
    - Get With the Future - - Way Better! - -

    -
    - - - - - - - -

    - -
    -
    - - -
    - - -
    - - - - idx - txt - - -
    -
    - -
    -

    - - -SiSU manual - -

    - - - - - - - - - - - -
    -

    - - -SiSU - -

    - - - - - - - - - - - - - - - - -
    - - - - diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/dom.xml b/data/doc/manuals_generated/sisu_manual/sisu_description/dom.xml deleted file mode 100644 index bfb6556f..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/dom.xml +++ /dev/null @@ -1,2867 +0,0 @@ - - - - - - - - -
    - Title: - - SiSU - Description - -
    - -
    - Creator: - - Ralph Amissah - -
    - -
    - Rights: - - Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3 - -
    - -
    - Type: - - information - -
    - -
    - Subject: - - ebook, epublishing, electronic book, electronic publishing, electronic document, electronic citation, data structure, citation systems, search - -
    - -
    - Date created: - - 2002-11-12 - -
    - -
    - Date issued: - - 2002-11-12 - -
    - -
    - Date available: - - 2002-11-12 - -
    - -
    - Date modified: - - 2007-08-30 - -
    - -
    - Date: - - 2007-08-30 - -
    - - - - - - - - - - - - - - 1 - SiSU - Description,
    Ralph Amissah
    -
    -
    - - - - - 2 - SiSU an attempt to describe - - - - - - - 3 - 1 - - 1. Description - - - - - - - - - - 4 - 1.1 - - 1.1 Outline - - - - - - - - 5 - - - SiSU is a flexible document preparation, generation publishing and search system.1This information was first placed on the web 12 November 2002; with predating material taken from <http://www.jus.uio.no/lm/lm.information/toc.html> part of a site started and developed since 1993. See document metadata section <http://www.jus.uio.no/sisu/SiSU/metadata.html> for information on this version. Dates related to the development of SiSU are mostly contained within the Chronology section of this document, e.g. <http://www.jus.uio.no/sisu/sisu_chronology> - - - - - - - - 6 - - - SiSU ("SiSU information Structuring Universe" or "Structured information, Serialized Units"),2also chosen for the meaning of the Finnish term "sisu". is a Unix command line oriented framework for document structuring, publishing and search. Featuring minimalistic markup, multiple standard outputs, a common citation system, and granular search. - - - - - - - - 7 - - - Using markup applied to a document, SiSU can produce plain text, HTML, XHTML, XML, OpenDocument, LaTeX or PDF files, and populate an SQL database with objects3objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. (equating generally to paragraph-sized chunks) so searches may be performed and matches returned with that degree of granularity (e.g. your search criteria is met by these documents and at these locations within each document). Document output formats share a common object numbering system for locating content. This is particularly suitable for "published" works (finalized texts as opposed to works that are frequently changed or updated) for which it provides a fixed means of reference of content. - - - - - - - - 8 - - - SiSU is the data/information structuring and transforming tool, that has resulted from work on one of the oldest law web projects. It makes possible the one time, simple human readable markup of documents, that SiSU can then publish in various forms, suitable for paper4pdf via LaTeX or lout , web5currently html (two forms of html presentation one based on css the other on tables), and PHP; potentially structured XML and relational database6any SQL - currently PostgreSQL and sqlite (for portability, testing and development) presentations, retaining common data-structure and meta-information across the output/presentation formats. Several requirements of legal and scholarly publication on the web have been addressed, including the age old need to be able to reliably cite/pinpoint text within a document, to easily make footnotes/endnotes, to allow for semantic document meta-tagging, and to keep required markup to a minimum. These and other features of interest are listed and described below. A few points are worth making early (and will be repeated a number of times): - - - - - - - - 9 - - - (i) The SiSU document generator was the first to place material on the web with a system that makes possible citation across different document types, with paragraph, or rather object citation numbering7previously called "text object numbering" a text positioning system, available for the pinpointing of text, 1997, a simple idea from which much benefit, and SiSU remains today, to the best of my knowledge, the only multiple format e-book/ electronic-document system on the web that gives you this possibility (including for relational databases). - - - - - - - - 10 - - - (ii) Markup is done once for the multiple formats produced. - - - - - - - - 11 - - - (iii) Markup is simple, and human readable (with a little practice), in almost all cases there is less and simpler markup required than basic html. In any event the markup required is very much simpler than the html, LaTeX, [lout], structured XML, ODF (OpenDocument), PostgreSQL or SQLite feed etc. that you can have SiSU generate for you. - - - - - - - - 12 - - - (iv) SiSU is a batch processor, dealing with as many files as you need to generate at a time. - - - - - - - - 13 - - - (v) Scalability is dependent on your file system (in my case Reiserfs), the database (currently Postgresql and/or SQLite) and your hardware. - - - - - - - - 14 - - - SiSU Sabaki8SiSU Sabaki, release version. Pre-release version SiSU Scribe, and version prior to that SiSU nicknamed Scribbler. Pre-release versions go back several years. Both Scribbler and Scribe (still maintained) made system calls to SiSU's various parts, instead of using libraries. (or just SiSU) is the provisional name given to the software described here that helps structure documents for web and other publication. The name SiSU is a loose anagram for something along the lines of "SiSU is structuring unit", or "SiSU, information structuring unit" or the more descriptive "Structured information, Serialized Units" or "simple - information structuring unit" or the more descriptive "Structured information, Serialized Units" or what it may be directed towards "semantic and information structuring universe" ,9A little universe it may be, but semantic you may have a hard time getting away with, given the meaning the word has taken on with markup. On a document wide basis semantic information may be provided, which can be really useful, (and meaningful, especially) if you have a large document set, and use this with rss feeds or in an sql database etc. On a markup level, I have little inclination to add semantic markup formally beyond references, title, author [Dublin Core entities? addresses?] etc. Actually this deserves a bit of thought possibly use letter tags (including letter alias/synonyms for font faces) to create a small set of default semantic tags, with the possibility for per document adjustments. Will seek to permit XML entity tagging, within SiSU markup and have that ignored/removed by the parts of the program that have no use for it. tongue in cheek, only just. Guess I'll get away with "Simple - information Structuring Universe". SiSU is also a Finnish word roughly meaning guts, inner strength and perseverance.10"Sisu refers not to the courage of optimism, but to a concept of life that says, 'I may not win, but I will gladly give my life for what I believe.'" Aini Rajanen, Of Finnish Ways, 1981, p. 10.
    <http://www.humanlanguages.com/finnishenglish/rlfs.htm>
    "Every Finn has his own pet definition. To me, sisu means patience without passion. But there are many varieties of sisu. Sisu can be a sudden outburst or it can be the kind that lasts. A man can have both kinds. It is outside reason. It is something in the soul. It comes from oneself. For instance, it makes a soldier do things because he himself must, not because he has been told." Paavo Nurmi
    <http://personalweb.smcvt.edu/tmatikainen/finnishtraditions.htm>
    - -
    - - - - - - 15 - - - SiSU was born of the need to find a way, with minimal effort, and for as wide a range of document types as possible, to produce high quality publishing output in a variety of document formats. As such it was necessary to find a simple document representation that would work across a large number of document types, and the most convenient way(s) to produce acceptable output formats. The project leading to this program was started in 1993 (together with the trade law project now known as Lex Mercatoria) as an investigation of how to effectively/efficiently place documents on the web. The unified document handling, together with features such as paragraph numbering, endnote handling and tables... appeared in 1996/97. SiSU was originally written in Perl,11<http://www.perl.org/> and converted to Ruby, 12<http://www.ruby-lang.org/en/> in 2000, one of the most impressive programming languages in existence! In its current form it has been written to run on the Gnu /Linux platform, and in particular on Debian, 13<http://www.debian.org/> taking advantage of many of the wonderful projects that are available there. - - - - - - - - 16 - - - SiSU markup is based on requiring the minimum markup needed to determine the structure of a document. (This can be as little as saying in a header to look for the word Book at a specified level and the word Chapter at another level). SiSU then breaks a document into its smallest parts (at a heading, and paragraph level) while retaining all structural information. This break up of the document and information on its structure is taken advantage of in the transformations made in generating the very different output types that can be created, and in providing as much as can be for what each output type is best at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), ODF (OpenDocument [experimental]), SQL (e.g. document search; representing constituent parts of documents based on their structure, headings, chapters, paragraphs as required; user control).14where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. - - - - - - - - 17 - - - From markup that is simpler and more sparse than html you get: - - - - - - - - 18 - - - far greater output possibilities, including html, XML, ODF (OpenDocument), LaTeX (pdf), and SQL; - - - - - - - - 19 - - - the advantages implicit in the very different output possibilities; - - - - - - - - 20 - - - a common citation system (for all outputs - including the relational database, search results are relevant for all outputs); - - - - - - - - 21 - - - For more see the short summary of features provided below. - - - - - - - - 22 - - - SiSU processes files with minimal tagging to produce various document outputs including html, LaTeX or lout (which is converted to pdf) and if required loads the structured information into an SQL database (PostgreSQL and SQLite have been used for this). SiSU produces an intermediate processing format.15This proved to be the easiest way to develop syntax, changes could be made, or alternatives provided for the markup syntax whilst the intermediate markup syntax was largely held constant. There is actually an optional second intermediate markup format in YAML <http://www.yaml.org/> - - - - - - - - 23 - - - SiSU is used in constructing Lex Mercatoria <http://lexmercatoria.org/> or <http://www.jus.uio.no/lm/> (one of the oldest law web sites), and considerable thought went into producing output that would be suitable for legal and academic writings (that do not have formulae) given the limitations of html, and publication in a wide variety of "formats", in particular in relation to the convenient and accurate citation of text. However, the construction of Lex Mercatoria uses only a fraction of the features available from SiSU today, vis generation of flat file structures, rather than in addition the building of ("granular") SQL database content, (at an object level with relevant relational tables, and other outputs also available). - - - - -
    - -
    - - - - - 24 - 1.2 - - 1.2 Short summary of features - - - - - - - - 25 - - - (i) markup syntax: (a) simpler than html, (b) mnemonic, influenced by mail/messaging/wiki markup practices, (c) human readable, and easily writable, - - - - - - - - 26 - - - (ii) (a) minimal markup requirement, (b) single file marked up for multiple outputs, - - - - - - - - 27 - - - notes: - - - - - - - - 28 - - - * documents are prepared in a single UTF-8 file using a minimalistic mnemonic syntax. Typical literature, documents like "War and Peace" require almost no markup, and most of the headers are optional. - - - - - - - - 29 - - - * markup is easily readable/parsed by the human eye, (basic markup is simpler and more sparse than the most basic html), [this may also be converted to XML representations of the same input/source document]. - - - - - - - - 30 - - - * markup defines document structure (this may be done once in a header pattern-match description, or for heading levels individually); basic text attributes (bold, italics, underscore, strike-through etc.) as required; and semantic information related to the document (header information, extended beyond the Dublin core and easily further extended as required); the headers may also contain processing instructions. - - - - - - - - 31 - - - (iii) (a) multiple outputs primarily industry established and institutionally accepted open standard formats, include amongst others: plaintext (UTF-8); html; (structured) XML; ODF (Open Document text)l; LaTeX; PDF (via LaTeX); SQL type databases (currently PostgreSQL and SQLite). Also produces: concordance files; document content certificates (md5 or sha256 digests of headings, paragraphs, images etc.) and html manifests (and sitemaps of content). (b) takes advantage of the strengths implicit in these very different output types, (e.g. PDFs produced using typesetting of LaTeX, databases populated with documents at an individual object/paragraph level, making possible granular search (and related possibilities)) - - - - - - - - 32 - - - (iv) outputs share a common numbering system (dubbed "object citation numbering" (ocn)) that is meaningful (to man and machine) across various digital outputs whether paper, screen, or database oriented, (PDF, html, XML, sqlite, postgresql), this numbering system can be used to reference content. - - - - - - - - 33 - - - (v) SQL databases are populated at an object level (roughly headings, paragraphs, verse, tables) and become searchable with that degree of granularity, the output information provides the object/paragraph numbers which are relevant across all generated outputs; it is also possible to look at just the matching paragraphs of the documents in the database; [output indexing also work well with search indexing tools like hyperesteier]. - - - - - - - - 34 - - - (vi) use of semantic meta-tags in headers permit the addition of semantic information on documents, (the available fields are easily extended) - - - - - - - - 35 - - - (vii) creates organised directory/file structure for (file-system) output, easily mapped with its clearly defined structure, with all text objects numbered, you know in advance where in each document output type, a bit of text will be found (e.g. from an SQL search, you know where to go to find the prepared html output or PDF etc.)... there is more; easy directory management and document associations, the document preparation (sub-)directory may be used to determine output (sub-)directory, the skin used, and the SQL database used, - - - - - - - - 36 - - - (viii) "Concordance file" wordmap, consisting of all the words in a document and their (text/ object) locations within the text, (and the possibility of adding vocabularies), - - - - - - - - 37 - - - (ix) document content certification and comparison considerations: (a) the document and each object within it stamped with an md5 hash making it possible to easily check or guarantee that the substantive content of a document is unchanged, (b)version control, documents integrated with time based source control system, default RCS or CVS with use of $Id: sisu_description.sst,v 1.25 2007/08/23 12:22:36 ralph Exp $ tag, which SiSU checks - - - - - - - - 38 - - - (x) SiSU's minimalist markup makes for meaningful "diffing" of the substantive content of markup-files, - - - - - - - - 39 - - - (xi) easily skinnable, document appearance on a project/site wide, directory wide, or document instance level easily controlled/changed, - - - - - - - - 40 - - - (xii) in many cases a regular expression may be used (once in the document header) to define all or part of a documents structure obviating or reducing the need to provide structural markup within the document, - - - - - - - - 41 - - - (xiii) prepared files may be batch process, documents produced are static files so this needs to be done only once but may be repeated for various reasons as desired (updated content, addition of new output formats, updated technology document presentations/representations) - - - - - - - - 42 - - - (xiv) possible to pre-process, which permits: the easy creation of standard form documents, and templates/term-sheets, or; building of composite documents (master documents) from other sisu marked up documents, or marked up parts, i.e. import documents or parts of text into a main document should this be desired - - - - - - - - 43 - - - there is a considerable degree of future-proofing, output representations are "upgradeable", and new document formats may be added. - - - - - - - - 44 - - - (xv) there is a considerable degree of future-proofing, output representations are "upgradeable", and new document formats may be added: (a) modular, (thanks in no small part to Ruby) another output format required, write another module.... (b) easy to update output formats (eg html, XHTML, LaTeX/PDF produced can be updated in program and run against whole document set), (c) easy to add, modify, or have alternative syntax rules for input, should you need to, - - - - - - - - 45 - - - (xvi) scalability, dependent on your file-system (ext3, Reiserfs, XFS, whatever) and on the relational database used (currently Postgresql and SQLite), and your hardware, - - - - - - - - 46 - - - (xvii) only marked up files need be backed up, to secure the larger document set produced, - - - - - - - - 47 - - - (xviii) document management, - - - - - - - - 48 - - - (xix) Syntax highlighting for SiSU markup is available for a number of text editors. - - - - - - - - 49 - - - (xx) remote operations: (a) run SiSU on a remote server, (having prepared sisu markup documents locally or on that server, i.e. this solution where sisu is installed on the remote server, would work whatever type of machine you chose to prepare your markup documents on), (b) generated document outputs may be posted by sisu to remote sites (using rsync/scp) (c)document source (plaintext utf-8) if shared on the net may be identified by its url and processed locally to produce the different document outputs. - - - - - - - - 50 - - - (xxi) document source may be bundled together (automatically) with associated documents (multiple language versions or master document with inclusions) and images and sent as a zip file called a sisupod, if shared on the net these too may be processed locally to produce the desired document outputs, these may be downloaded, shared as email attachments, or processed by running sisu against them, either using a url or the filename. - - - - - - - - 51 - - - (xxii) for basic document generation, the only software dependency is Ruby, and a few standard Unix tools (this covers plaintext, html, XML, ODF, LaTeX). To use a database you of course need that, and to convert the LaTeX generated to PDF, a LaTeX processor like tetex or texlive. - - - - - - - - 52 - - - as a developers tool it is flexible and extensible - - - - - - - - 53 - - - SiSU was developed in relation to legal documents, and is strong across a wide variety of texts (law, literature...). SiSU handles images but is not suitable for formulae/ statistics, or for technical writing at this time. - - - - - - - - 54 - - - SiSU has been developed and has been in use for several years. Requirements to cover a wide range of documents within its use domain have been explored. - - - - - - - - 55 - - - Some modules are more mature than others, the most mature being Html and LaTeX / pdf. PostgreSQL and search functions are useable and together with ocn unique (to the best of my knowledge). The XML output document set is "well formed" but largely proof of concept. - - - - - - - - - - - - 56 - 1.3 - - 1.3 How it works - - - - - - - - 57 - - - SiSU markup is fairly minimalistic, it consists of: a (largely optional) document header, made up of information about the document (such as when it was published, who authored it, and granting what rights) and any processing instructions; and markup within text which is related to document structure and typeface. SiSU must be able to discern the structure of a document, (text headings and their levels in relation to each other), either from information provided in the instruction header or from markup within the text (or from a combination of both). Processing is done against an abstraction of the document comprising of information on the document's structure and its objects,16objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. which the program serializes (providing the object numbers) and which are assigned hash sum values based on their content. This abstraction of information about document structure, objects, (and hash sums), provides considerable flexibility in representing documents different ways and for different purposes (e.g. search, document layout, publishing, content certification, concordance etc.), and makes it possible to take advantage of some of the strengths of established ways of representing documents, (or indeed to create new ones). - - - - - - - - - - - - 58 - 1.4 - - 1.4 Simple markup - - - - - - - - 59 - - - SiSU markup is based on requiring the minimum markup needed to determine the structure of a document. (This can be as little as saying in a header to look for the word Book at a specified level and the word Chapter at another level). SiSU then breaks a document into its smallest parts (at a heading, and paragraph level) while retaining all structural information. This break up of the document and information on its structure is taken advantage of in the transformations made in generating the very different output types that can be created, and in providing as much as can be for what each output type is best at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), ODF (OpenDocument), SQL (e.g. document search; representing constituent parts of documents based on their structure, headings, chapters, paragraphs as required; user control).17where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. - - - - - - - - - - 60 - 1.4.1 - - 1.4.1 Sparse markup requirement, try to get the most out of markup - - - - - - - - 61 - - - One of its strengths is that very small amounts of initial tagging is required for the program to generate its output. - - - - - - - - 62 - - - This is a basic markup example: - - - - - - - - 63 - - - basic markup example, text file - an international convention 18<http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> output provided as example in the next section - - - - - - - - 64 - - - view basic markup, as it would be highlighted by vim editor 19<http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> as it would appear with syntax highlighting (by vim) - - - - - - - - 65 - - - Emphasis has been on simplicity and minimalism in markup requirements. Design philosophy is to try keep the amount of markup required low, for whatever has been determined to be acceptable output.20seems there are several "smart ASCIIs" available, primarily for ascii to html conversion, that make this, and reasonable looking ascii their goal
    <http://webseitz.fluxent.com/wiki/SmartAscii>
    <http://daringfireball.net/projects/markdown/>
    <http://www.textism.com/tools/textile/>
    - -
    - - - - - - 66 - - - SiSU's markup is more minimalistic and simpler than (the equivalent) html and for it, you get considerably more than just html, as this preparation gives you all available output formats, upon request. - - - - -
    - -
    - - - - - 67 - 1.4.2 - - 1.4.2 Single markup file provides multiple output formats - - - - - - - - 68 - - - For each document, there is only one (input, minimalistically marked up) file from which all the available output types are generated.21These include richly laid out and linked html (table or css variants), PHP, LaTeX (from which pdf portrait and landscape documents are produced), texinfo (for info files etc.), and PostgreSQL and/or SQLite. And the opportunity to fairly easily build additional modules, such as XML. See the examples provided in this document. - - - - - - - - 69 - - - Eg. the markup example: - - - - - - - - 70 - - - original text file - an international convention 22<http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> - - - - - - - - 71 - - - view as syntax would be highlighted by vim editor 23<http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> - - - - - - - - 72 - - - Produces the following output: - - - - - - - - 73 - - - Segmented html version of document 24<http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html> - - - - - - - - 74 - - - Full length html document 25<http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/doc.html> - - - - - - - - 75 - - - pdf landscape version of document 26<http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/landscape.pdf> - - - - - - - - 76 - - - pdf portrait version of document 27<http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/portrait.pdf> - - - - - - - - 77 - - - clean tex ascii version of document 28<http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/plain.txt> - - - - - - - - 78 - - - xml sax version of document 29<http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/sax.xml> - - - - - - - - 79 - - - xml dom version of document 30<http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/dom.xml> - - - - - - - - 80 - - - Concordance 31<http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/concordance.html> - - - - - - - - 81 - - - (and in addition to these: PostgreSQL, SQLite, texinfo and YAML 32discontinued for the time being versions if desired) - - - - - - - - - - - - 82 - 1.4.3 - - 1.4.3 Syntax relatively easy to read and remember - - - - - - - - 83 - - - Syntax is kept simple and mnemonic.33SiSU markup syntax, an incomplete summary: <http://www.jus.uio.no/sisu/sisu_markup_table/doc.html#h200306>
    Visual check of elementary font face modifiers: bold bold emphasis italics underscore strikethrough superscript subscript
    - -
    - - -
    - -
    - - - - - 84 - 1.4.4 - - 1.4.4 Kept simple by having a limited publishing feature set, and features identified as most important, are available across several document types - - - - - - - - 85 - - - To keep SiSU markup sparse and simple SiSU deliberately provides a limited publishing feature set, including: indent levels; bold; italics; superscript; subscript; simple tables; images; tables of contents and; endnotes. Which in most cases are available across the different output formats. - - - - - - - - 86 - - - The publishing feature set may be expanded as required. - - - - - - - - -
    - - - - - 87 - 1.5 - - 1.5 Designed with usability in mind - - - - - - - - 88 - - - Output is designed to be uniform, easy to read, navigate and cite. - - - - - - - - - - - - 89 - 1.6 - - 1.6 Code separate from content - - - - - - - - 90 - - - Code34the program that generates the documents is separated from content. This means that when changes are desired in the output presentation, the code that produces them, and not the marked up text data set (which could be thousands of documents) is modified. Separating code from content makes large scale changes to output appearance trivial, and permits the easy addition of new output modules. - - - - - - - - - - - - 91 - 1.7 - - 1.7 Object citation numbering, a text or object positioning / citation system - "paragraph" (or text object) numbering, that remains same and usable across all output formats by people and machine - - - - - - - - 92 - - - Object citation numbering is a simple object (text) positioning and cition system that is human relevant and machine useable, used by SiSU for all manner of presentations, and that is available for use in all text mappings. It is based on the automated sequential numbering of objects (roughly paragraphs, (headings, tables, verse) or other blocks of text or images etc.). The text positioning system (in which I claim copyright) is invaluable for publishing requiring the citing text across multiple output formats, and for the general mapping of text within a document: - - - - - - - - 93 - - - in html, html not being easily citeable (change font size, or use a different browser and the page on which specific text appears has changed), and - - - - - - - - 94 - - - across multiple formats being common to all output formats html/xml/pdf/sql output, - - - - - - - - 95 - - - the results of an sql search can just be "live" citation references to the documents in which the text is found, much like an index (see image examples provided). 35<http://www.jus.uio.no/sisu/SiSU/1.html#search> - - - - - - - - 96 - - - I claim copyright on the system I use which is the most basic of all, numbering all text in headings and paragraphs sequentially (with tables and images being treated as a single paragraph) and only footnotes/endnotes not following this numbering, as their position in text is not strictly determined, (a change from footnotes to endnotes would change their numbering), footnotes instead "belong" to the paragraph from which they are referenced, and have sequential numbers of their own. - - - - - - - - 97 - - - SiSU has a paragraph numbering system, that remains the same regardless of the output format. This provides an effective means of citation, pinpointing text accurately in all output formats, using the same reference. This is particularly useful where text has to be located across different output formats - for example once html is printed the number of pages and pages on which given text is found will vary depending on the browser, its settings the font size setting etc. Similarly SiSU produces pdf in different forms, eg. on the example site Lex Mercatoria as portrait and landscape documents - here too page numbering varies, but paragraph numbering is the same, vis a vis all versions of the text (portrait and landscape pdf and the html versions of the text, and as stored (with "paragraphs" as records) to the PostgreSQL or SQLite database). - - - - - - - - 98 - - - These numbers are placed in the text margins and are intended to be independent of and not to interfere with authors tagging. [The citation system (object citation numbering system, automated "paragraph numbering") which is automatically generated and is common and identical across all document formats] The paragraph numbering system is more accurately described as an (text) object numbering system, as headings are also numbered... all headings and paragraphs are numbered sequentially. Endnotes are automatically numbered independently and rather "belong" to the paragraph from which they are referenced, as an endnote does not (necessarily) form a part of a documents sequence, (they may be produced as either endnotes or footnotes (or both depending on what output you choose to look at - if you take the segmented html version document provided as an example, you will find that the endnotes are placed both at the end of each section, and in a separate section of their own called endnotes, and these are hyper-linked)). An attractive feature of providing citation numbering in this way is that it is independent of the document structure... it remains the same regardless of what is done about the document structure. - - - - - - - - 99 - - - The rules have been kept very simple, unique incremental object citation numbers are assigned to headings, paragraphs, verse, tables and images. It is possible to manually override this feature on a per heading or comment basis though this should be used exceptionally, it may be of use where there a substantive text, and the addition of a minor comment by the publisher that should not be mapped as part of the text. - - - - - - - - 100 - - - The object citation number markers contain additional numbering information with regard to the document structure, that can be used for alternative presentations, including such detail as the type of object (heading, paragraph, table, image, etc.), numbered sequentially. - - - - - - - - 101 - - - An advantage is that the numbering remains the same regardless of document structure. - - - - - - - - 102 - - - Text object ("paragraph") numbering is the same for all output versions of the same document, vis html, pdf, pgsql, yaml etc. - - - - - - - - 103 - - - In the relational database, as individual text objects of a document stored (and indexed) together with object numbers, and all versions of the document have the same numbering, the results of searches may be tailored just to provide the location of the search result in all available document formats. - - - - - - - - 104 - - - Note: there is a bug in the released behaviour of object citation numbering, (not certain when it was introduced) tables should be numbered, ie each table gets an ocn, required amongst other things for relational database. This will be corrected in a future release. Citation numbering of existing documents that contain tables will changed. - - - - - - - - - - - - 105 - 1.8 - - 1.8 Handling of Dublin Core meta-tags making use of the Resource Description Framework - - - - - - - - 106 - - - SiSU is able to use meta tags based on the Dublin Core36<http://dublincore.org/> and Resource Description Framework37<http://www.w3.org/RDF/> - - - - - - - - 107 - - - This provides the means of providing semantic information about a document, both as computer processable meta-tags, and as human readable information that may be of value for classification purposes. - - - - - - - - 108 - - - This information is provided both in html metatags, and (where available) under the section titled "Document Information - MetaData", near the end of a document, for example in the segmented html version of this text at: <http://www.jus.uio.no/sisu/SiSU/metadata.html> - - - - - - - - - - - - 109 - 1.9 - - 1.9 Easy directory management - - - - - - - - 110 - - - 1. Directory file association, skins and special image management, made simpler.38The previous way was directory associations for file output were set up in the configuration file. The present system is a more natural way to work requireing less configuration. - - - - - - - - 111 - - - The last part of the name of the work directory in which markup is being done, or rather from where SiSU is run in order to generate document output, is used in determining the sub-directory name for output files, that is created in the document output directory. This provides a rather easy way to associate documents e.g. of a given subject, or by owner. - - - - - - - - 112 - - - - -      /www/docs
             /intellectual_property
             /arbitration
             /contract_law

         /www/docs
             /ralph
             /sisu     - -
    - -
    - - - - - - 113 - - - all are placed in their own directories within the directory structure created. Similar rules are used in the creation of sql type databases (though they can be overridden). - - - - - - - - 114 - - - There are a couple of further associations with these directories. - - - - - - - - 115 - - - Directory wide skins. - - - - - - - - 116 - - - Directory specific images. - - - - - - - - 117 - - - 2. If there is a "directory skin", that is a skin of the same name as the directory, it is used in the generation of the documents within it, rather than the default skin, unless the document has a specific skin associated with it. - - - - - - - - 118 - - - a. default skin (always available) - - - - - - - - 119 - - - b. directory skin (precedence over default if exists) - - - - - - - - 120 - - - c. document skin (takes precedence wherever document requests a specific skin) - - - - - - - - 121 - - - Skins are defined in the document skin directory and if a directory association is desired a softlink made to the relevant skin. Skins (directory association auto load) auto load skin if a directory skin exists of same name as directory stub, (and there is no specific doc skin) - - - - - - - - 122 - - - 3. If the working directory has within it a sub-directory called image_local, the images within that directory are used for references to images, that are not part of the default site build. - - - - -
    - -
    - - - - - 123 - 1.10 - - 1.10 Document Version Control Information - - - - - - - - 124 - - - The possibility of citing an exact document version. - - - - - - - - 125 - - - Permits the inclusion of document version control information to the document body and metatags.39from a version control system such as CVS This provides a much more certain method of referring to the exact version of a particular document, (assuming that the document is from a trusted source, that will retain earlier versions of a document).40The version control system must be run, so the version number is obtained, prior to the SiSU document generation, and subsequent posting of the document. - - - - - - - - 126 - - - This information (where available) is provided under the section of the document titled "Document Information - MetaData", near the end of a document, for example in the segmented html version of this text at: <http://www.jus.uio.no/sisu/SiSU/metadata.html> - - - - - - - - - - - - 127 - 1.11 - - 1.11 Table of contents - - - - - - - - 128 - - - SiSU produces a rudimentary a table of contents based on document headings. - - - - - - - - - - - - 129 - 1.12 - - 1.12 Auto-numbering of headings - - - - - - - - 130 - - - Headings can be automatically numbered, (and automatically named for hyper-linking) - - - - - - - - - - - - 131 - 1.13 - - 1.13 Numbering and cross-hyperlinking of endnotes - - - - - - - - 132 - - - SiSU can automatically number footnotes/endnotes. This is the default operation where no number is provided. - - - - - - - - 133 - - - Footnotes/endnotes may also be manually numbered. Where a number, or numbers are provided for a footnote/endnote, this does not increment the automatic footnote/endnote number counter. - - - - - - - - 134 - - - In the html output footnotes/endnotes are cross-hyper-linked (to their reference point and vice versa). In th pdf output footnotes are linked from their reference point only. - - - - - - - - - - - - 135 - 1.14 - - 1.14 "Skinnable" - - - - - - - - 136 - - - SiSU is skinnable, on a site-wide, directory-wide and per document basis, so different looking versions of things may be produced with little difficulty. There is a default skin which may be modified, as the background site skin, and each working directory may have a skin associated with it, as may each individual document. The hierarchy of application is document, directory, then site... ie if a document skin exists it gets precedence. - - - - - - - - 137 - - - Whilst it is skinnable, the default output styles are selected to work across the widest possible range of document types. - - - - - - - - - - - - 138 - 1.15 - - 1.15 Multiple Outputs - - - - - - - - 139 - - - From markup that is simpler and more sparse than html you get: - - - - - - - - 140 - - - far greater output possibilities, including multiple html types, XML (different structured types), LaTeX (pdf landscape, portrait), and SQL (Postgresql or SQLite or other); - - - - - - - - 141 - - - the advantages implicit in these very different output possibilities;41e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), SQL (e.g. document set searches; representation of the constituent parts of documents based on their structure, headings, chapters, paragraphs as desired; control of use) - - - - - - - - 142 - - - a common citation system - - - - - - - - 143 - - - As many output formats/presentations as one cares to write modules for - several types of html (e.g. structure based on css, or structure based on tables); LaTeX/pdf and Lout/pdf; pgsql other databases easily added; yaml... - - - - - - - - - - 144 - 1.15.1 - - 1.15.1 html - several presentations: full length & segmented; css & table based - - - - - - - - 145 - - - Most documents are produced in single and segmented html versions, described below: - - - - - - - - 146 - - - The Scroll (full length text presentations) - - - - - - - - 147 - - - The full length of the text in a single scrollable document.42CISG <http://www.jus.uio.no/lm/un_contracts_international_sale_of_goods_convention_1980/doc>
    The Unidroit Contract Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994/doc> or
    The Autonomous Contract <http://www.jus.uio.no/lm/autonomous.contract.2000.amissah/doc>
    As a rule the files they are saved in are named: doc or more precisely doc.html
    - -
    - - - - - - 148 - - - For various reasons texts may only be provided in this form (such as this one which is short), though most are also provided as segmented texts. - - - - - - - - 149 - - - "Scroll" is a reference to the historical scroll, a single long document/ parchment, and also no doubt to what you will have to do to get to the bottom of the text.43Scrolling is not however necessarily confined to full length documents as you will have to scroll to get to the bottom of any long segment (eg. chapter) of a segmented text. - - - - - - - - 150 - - - The Segmented Text - - - - - - - - 151 - - - The text divided into segments (such as articles or chapters depending on the text)44CISG <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980>
    The Unidroit Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994>
    The Autonomous Contract <http://www.jus.uio.no/sisu/the.autonomous.contract.2000.amissah> or
    WTA 1994 <http://www.jus.uio.no/lm/wta.1994>
    As a rule the files they are saved in are named: toc and index or more precisely toc.html and index.html
    - -
    - - - - - - 152 - - - If you know exactly what you are looking for, loading a segment of text is faster (the segments being smaller). Occasionally longer documents such as the WTA 1994 <http://www.jus.uio.no/lm/wta.1994/toc> are only provided in segmented form. - - - - - - - - 153 - - - Cascading Style Sheet, and Table based html - - - - - - - - 154 - - - SiSU outputs html, two current standard forms available are: - - - - - - - - 155 - - - css based - - - - - - - - 156 - - - and - - - - - - - - 157 - - - table based [largely discontinued ]45formatting possibility still exists in code tree but maintenance has been largely discontinuted. - - - - - - - - 158 - - - The html is tested across several browsers - - - - - - - - 159 - - - I like to remind you that there are other excellent browsers out there, many of which have long supported practical features like tabbing. - - - - - - - - 160 - - - The html is tested across several browsers, including: - - - - - - - - 161 - - - Firefox (Mozilla-Firefox) 46<http://www.mozilla.org/products/firefox/> - - - - - - - - 162 - - - Kazehakase 47<http://kazehakase.sourceforge.jp/> - - - - - - - - 163 - - - Konqueror 48<http://www.konqueror.org/> - - - - - - - - 164 - - - Mozilla 49<http://www.mozilla.org/> - - - - - - - - 165 - - - MS Internet Explorer 50<http://www.microsoft.com/windows/ie/default.asp> - - - - - - - - 166 - - - Netscape 51<http://home.netscape.com/comprod/mirror/client_download.html> - - - - - - - - 167 - - - Opera 52<http://www.opera.com/> - - - - - - - - 168 - - - Also lighter weight graphical browsers: - - - - - - - - 169 - - - Dillo 53<http://www.dillo.org/> - - - - - - - - 170 - - - Epiphany 54<http://www.gnome.org/projects/epiphany/> - - - - - - - - 171 - - - Galeon 55<http://galeon.sourceforge.net/> - - - - - - - - 172 - - - And for console/text browsing: - - - - - - - - 173 - - - elinks 56<http://elinks.or.cz/> - - - - - - - - 174 - - - links2 57<http://links.twibright.com/> - - - - - - - - 175 - - - w3m 58<http://w3m.sourceforge.net/> - - - - - - - - 176 - - - The html tables output is rendered more accurately across a wider variety set and older versions of browsers (than the html css output). - - - - -
    - -
    - - - - - 177 - 1.15.2 - - 1.15.2 XML - - - - - - - - 178 - - - SiSU generates well formed XML, and multiple versions. An XML SAX version with a flat/shallow structure, and XML DOM version with a deeper (embedded) structure. There is also a released working xhtml module. Examples of SAX and DOM versions are provided within this document. - - - - - - - - - - - - 179 - 1.15.3 - - 1.15.3 ODT:ODF, Open Document Format - ISO/IEC 26300:2006 - - - - - - - - 180 - - - SiSU generates Open Document Output format. - - - - - - - - - - - - 181 - 1.15.4 - - 1.15.4 PDF - portrait and landscape, (through the generation of LaTeX output which is then transformed to pdf) - - - - - - - - 182 - - - SiSU outputs LaTeX if required which is easily transformed to PDF.59LaTeX and pdf features introduced 18th June 2001, Landscape and portrait pdfs introduced 7th October 2001., Lout is a more recent addition 22th April 2003 PDF documents are generated on the site from the same source files and Ruby program that produce html. Landscape oriented pdf introduced, providing easier screen viewing, they are also (paper saving, being currently) formatted to have fewer pages than their portrait equivalents. - - - - - - - - 183 - - - Adobe Reader 60<http://www.adobe.com/products/acrobat/readstep2.html> - - - - - - - - 184 - - - Evince 61<http://www.gnome.org/projects/evince/> - - - - - - - - 185 - - - xpdf 62<http://www.foolabs.com/xpdf/> - - - - - - - - - - - - 186 - 1.15.5 - - 1.15.5 Search - loading/populating of relational database while retaining document structure information, object citation numbering and other features (currently PostgreSQL and/or SQLite) - - - - - - - - 187 - - - SiSU (from the same markup input file) automatically feeds into PostgreSQL63<http://www.postgresql.org/>
    <http://advocacy.postgresql.org/>
    <http://en.wikipedia.org/wiki/Postgresql>
    and/or SQLite64<http://www.hwaci.com/sw/sqlite/>
    <http://en.wikipedia.org/wiki/Sqlite>
    database (could be any other of the better relational databases)65Relational database features retaining document structure and citation introduced 15th July 2002 - together with all additional information related to document structure, and the alternative ways in which it is generated on the site retained. As regards scaling of the database, it is as scalable as the database (here Postgresql or SQLite) and hardware allow. I will prune the images later.
    - -
    - - - - - - 188 - - - This is one of the more interesting output forms, as all the structural data for the documents are retained (though can be ignored by the user of the database should they so choose). All site texts/documents are (currently) streamed to four pgsql database tables: - - - - - - - - 189 - - - one containing semantic (and other) headers, including, title, author, subject, (the Dublin Core...); - - - - - - - - 190 - - - another the substantive texts by individual "paragraph" (or object) - along with structural information, each paragraph being identifiable by its paragraph number (if it has one which almost all of them do), and the substantive text of each paragraph quite naturally being searchable (both in formatted and clean text versions for searching); and - - - - - - - - 191 - - - a third containing endnotes cross-referenced back to the paragraph from which they are referenced (both in formatted and clean text versions for searching). - - - - - - - - 192 - - - a fourth table with a one to one relation with the headers table contains full text versions of output, eg. pdf, html, xml, and ascii. - - - - - - - - 193 - - - There is of course the possibility to add further structures. - - - - - - - - 194 - - - At this level SiSU loads a relational database with documents broken in to their smallest logical structurally constituent parts, as text objects, with their object citation number and all other structural information needed to construct the structured document. Text is stored (at this text object level) with and without elementary markup tagging, the stripped version being so as to facilitate ease of searching. - - - - - - - - 195 - - - Because the document structure of sites created is clearly defined, and the text object citation system is available for all forms of output, it is possible to search the sql database, and either read results from that database, or just as simply map the results to the html output, which has richer text markup. - - - - - - - - 196 - - - The combination of the SiSU citation system with a relational database is pretty powerful, giving rise to several possibilities. As individual text objects of a document stored (and indexed) together with object numbers, and all versions of the document have the same numbering, complex searches can be tailored to return just the locations of the search results relevant for all available output formats, with live links to the precise locations in the database or in html/xml documents; or, the structural information provided makes it possible to search the full contents of the database and have headings in which search content appears, or to search only headings etc. (as the Dublin Core is incorporated it is easy to make use of that as well). - - - - - - - - 197 - - - This is a larger scale project, (with little development on the front end largely ignored), though the "infrastructure" has been in place since 2002. - - - - -
    - -
    - - - - - 198 - 1.15.6 - - 1.15.6 Search - database frontend sample, utilising database and SiSU features, including object citation numbering (backend currently PostgreSQL) - - - - - - - - 199 - - - Sample search frontend 66<http://search.sisudoc.org> A small database and sample query front-end (search from) that makes use of the citation system, object citation numbering to demonstrates functionality.67(which could be extended further with current back-end). As regards scaling of the database, it is as scalable as the database (here Postgresql) and hardware allow. - - - - - - - - 200 - - - SiSU can provide information on which documents are matched and at what locations within each document the matches are found. These results are relevant across all outputs using object citation numbering, which includes html, XML, LaTeX, PDF and indeed the SQL database. You can then refer to one of the other outputs or in the SQL database expand the text within the matched objects (paragraphs) in the documents matched. - - - - - - - - 201 - - - (further work needs to be done on the sample search form, which is rudimentary and only passes simple booleans correctly at present to the SQL engine) - - - - - - - - 202 - - - A few canned searches, showing object numbers. Search for: - - - - - - - - 203 - - - English documents matching Linux OR Debian - - - - - - - - 204 - - - GPL OR Richard Stallman - - - - - - - - 205 - - - invention OR innovation in English language - - - - - - - - 206 - - - copyright in English language documents - - - - - - - - 207 - - - Note that the searches done in this form are case sensitive. - - - - - - - - 208 - - - Expand those same searches, showing the matching text in each document: - - - - - - - - 209 - - - English documents matching Linux OR Debian - - - - - - - - 210 - - - GPL OR Richard Stallman - - - - - - - - 211 - - - invention OR innovation in English language - - - - - - - - 212 - - - copyright in English language documents - - - - - - - - 213 - - - Note you may set results either for documents matched and object number locations within each matched document meeting the search criteria; or display the names of the documents matched along with the objects (paragraphs) that meet the search criteria.68of this feature when demonstrated to an IBM software innovations evaluator in 2004 he said to paraphrase: this could be of interest to us. We have large document management systems, you can search hundreds of thousands of documents and we can tell you which documents meet your search criteria, but there is no way we can tell you without opening each document where within each your matches are found. - - - - - - - - 214 - - - OCN index mode, (object citation number) the numbers displayed are relevant (and may be used to reference the match) in any sisu generated rendition of the text69OCN are provided for HTML, XML, pdf ... though currently omitted in plain-text and opendocument format output the links provided are to the locations of matches within the html generated by SiSU. - - - - - - - - 215 - - - Paragraph mode, you may alternatively display the text of each paragraph in which the match was made, again the object/paragraph numbers are relevant to any SiSU generated/published text. - - - - - - - - 216 - - - Several options for output - select database to search, show results in index view (links to locations within text), show results with text, echo search in form, show what was searched, create and show a "canned url" for search, show available search fields. Also shows counters number of documents in which found and number of locations within documents where found. [could consider sorting by document with most occurrences of the search result]. - - - - - - - - 217 - - - Earlier version of the search frontend - Simple search, results with files in which search found, and locations where found within files. - - - - - - - - 218 - - - Simple search, results with files in which search found, and text object (paragraph or endnote) where found within files. - - - - - - - - - - - - 219 - 1.15.7 - - 1.15.7 Other forms - - - - - - - - 220 - - - There are other forms as well, YAML file, Ruby Marshal dumps, document pre-processing (processing of documents prior to the steps described here, to produce input suitable for the program) snap in a new module as required/desired, well formed XML, no problem. - - - - - - - - -
    - - - - - 221 - 1.16 - - 1.16 Concordance / Word Map or rudimentary index - - - - - - - - 222 - - - Concordance /WordMaps:70Concordance/ WordMaps introduced 15th August 2002 SiSU produces a rudimentary index based on the words within the text, making use of paragraph numbers to identify text locations. This is generated in html and hyper-linked but identifies these words locations in the other document formats. Though it is possible to search using a search engine, this is a means for browsing an alphabetical list of words which may suggest other useful content. - - - - - - - - - - - - 223 - 1.17 - - 1.17 Managed (document) directory, database, or site structure - - - - - - - - 224 - - - SiSU builds the web site (or more generically provides a suitable directory structure) - placing various output texts in the hierarchy of the web-site (or db), which (for directories) is a sub-directory with the name of the text file. - - - - - - - - - - - - 225 - 1.18 - - 1.18 Batch processing - - - - - - - - 226 - - - SiSU is a batch processing tool, handling and transforming multiple (or individual) documents (in many ways) with a single instruction. - - - - - - - - - - - - 227 - 1.19 - - 1.19 Integration to superior Gnu/Linux and Unix tools - - - - - - - - 228 - - - As should have been noted by the above description of SiSU, it makes use of existing programs found on Gnu /Linux and Unix, amongst those already mentioned include the LaTeX to pdf converters and the database PostgreSQL or SQLite. - - - - - - - - - - 229 - 1.19.1 - - 1.19.1 Backup and version control - - - - - - - - 230 - - - Unix provides many tools for version control. For documents Subversion, CVS and even the old RCS are useful for the per-document histories they provide. - - - - - - - - 231 - - - For writing code superior (more recent) version control system exist. These can also be used for documents though they tend to take stamps of changes across the repository as a whole, rather than for each individual file that is tracked, (as CVS and RCS do). My personal preference is for distributed systems such as Git, Mercurial or Darcs, of which I use Git for both code and documents. - - - - - - - - 232 - - - Several backup tools exist. At the base level I tend to use rdiff. - - - - - - - - - - - - 233 - 1.19.2 - - 1.19.2 Editor support - - - - - - - - 234 - - - SiSU documents are prepared / marked up in utf-8 text you are free to use the text editor of your choice. - - - - - - - - 235 - - - Syntax highlighting for a number of editors are provided. Amongst them Vim, Kwrite, Kate, Gedit and diakonos. These may be found with configuration instructions at <http://www.jus.uio.no/sisu/syntax_highlight>. Vim 71<http://www.vim.org/> as of version 7 has built in sytax highlighting for SiSU. - - - - - - - - - - - - - - 236 - 1.20 - - 1.20 Modular design, need something new add a module - - - - - - - - 237 - - - Need a new output format that does not already exist, write a new module. - - - - - - - - 238 - - - Prefer a new input syntax, you could write a new syntax matching the existing design, though my personal preference is some uniformity in entry appearance. If necessary has been fairly easy to extend the design parameters. It is intended to incorporate some additional basic semantic tagging, (book, article, author etc.) However, keeping the requirements for input minimal, and relatively simple has been a design goal. - - - - - - - - -
    - - - - - 0 - endnotes - - Endnotes - - - - - - - -
    - -
    - - - -
    - diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/endnotes.html b/data/doc/manuals_generated/sisu_manual/sisu_description/endnotes.html deleted file mode 100644 index e0455d4f..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/endnotes.html +++ /dev/null @@ -1,1012 +0,0 @@ - - - - - - endnotes - - SiSU - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - - - -
    - - - TOC linked  toc  - - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    - - - - - - -
    - - << previous - - - - TOC - - - - next >> - - -
    -
    - - - - -
    - - < - - - - ^ - - - - > - -
    - -
    -

    - SiSU -

    -

    - Description -

    -

    - Ralph Amissah - -

    - -
    -

    Endnotes

    - - -
    -

    -  1. This information was first placed on the web 12 November 2002; with predating material taken from <http://www.jus.uio.no/lm/lm.information/toc.html> part of a site started and developed since 1993. See document metadata section <http://www.jus.uio.no/sisu/SiSU/metadata.html> for information on this version. Dates related to the development of SiSU are mostly contained within the Chronology section of this document, e.g. <http://www.jus.uio.no/sisu/sisu_chronology> -

    -
      -
    - - -
    -

    -  2. also chosen for the meaning of the Finnish term "sisu". -

    -
      -
    - - -
    -

    -  3. objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. -

    -
      -
    - - -
    -

    -  4. pdf via LaTeX or lout -

    -
      -
    - - -
    -

    -  5. currently html (two forms of html presentation one based on css the other on tables), and PHP; potentially structured XML -

    -
      -
    - - -
    -

    -  6. any SQL - currently PostgreSQL and sqlite (for portability, testing and development) -

    -
      -
    - - -
    -

    -  7. previously called "text object numbering" -

    -
      -
    - - -
    -

    -  8. SiSU Sabaki, release version. Pre-release version SiSU Scribe, and version prior to that SiSU nicknamed Scribbler. Pre-release versions go back several years. Both Scribbler and Scribe (still maintained) made system calls to SiSU's various parts, instead of using libraries. -

    -
      -
    - - -
    -

    -  9. A little universe it may be, but semantic you may have a hard time getting away with, given the meaning the word has taken on with markup. On a document wide basis semantic information may be provided, which can be really useful, (and meaningful, especially) if you have a large document set, and use this with rss feeds or in an sql database etc. On a markup level, I have little inclination to add semantic markup formally beyond references, title, author [Dublin Core entities? addresses?] etc. Actually this deserves a bit of thought possibly use letter tags (including letter alias/synonyms for font faces) to create a small set of default semantic tags, with the possibility for per document adjustments. Will seek to permit XML entity tagging, within SiSU markup and have that ignored/removed by the parts of the program that have no use for it. -

    -
      -
    - - -
    -

    -  10. "Sisu refers not to the courage of optimism, but to a concept of life that says, 'I may not win, but I will gladly give my life for what I believe.'" Aini Rajanen, Of Finnish Ways, 1981, p. 10.
    <http://www.humanlanguages.com/finnishenglish/rlfs.htm>
    "Every Finn has his own pet definition. To me, sisu means patience without passion. But there are many varieties of sisu. Sisu can be a sudden outburst or it can be the kind that lasts. A man can have both kinds. It is outside reason. It is something in the soul. It comes from oneself. For instance, it makes a soldier do things because he himself must, not because he has been told." Paavo Nurmi
    <http://personalweb.smcvt.edu/tmatikainen/finnishtraditions.htm> -

    -
      -
    - - -
    -

    -  11. <http://www.perl.org/> -

    -
      -
    - - -
    -

    -  12. <http://www.ruby-lang.org/en/> -

    -
      -
    - - -
    -

    -  13. <http://www.debian.org/> -

    -
      -
    - - -
    -

    -  14. where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. -

    -
      -
    - - -
    -

    -  15. This proved to be the easiest way to develop syntax, changes could be made, or alternatives provided for the markup syntax whilst the intermediate markup syntax was largely held constant. There is actually an optional second intermediate markup format in YAML <http://www.yaml.org/> -

    -
      -
    - - -
    -

    -  16. objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. -

    -
      -
    - - -
    -

    -  17. where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. -

    -
      -
    - - -
    -

    -  18. <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> output provided as example in the next section -

    -
      -
    - - -
    -

    -  19. <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> as it would appear with syntax highlighting (by vim) -

    -
      -
    - - -
    -

    -  20. seems there are several "smart ASCIIs" available, primarily for ascii to html conversion, that make this, and reasonable looking ascii their goal
    <http://webseitz.fluxent.com/wiki/SmartAscii>
    <http://daringfireball.net/projects/markdown/>
    <http://www.textism.com/tools/textile/> -

    -
      -
    - - -
    -

    -  21. These include richly laid out and linked html (table or css variants), PHP, LaTeX (from which pdf portrait and landscape documents are produced), texinfo (for info files etc.), and PostgreSQL and/or SQLite. And the opportunity to fairly easily build additional modules, such as XML. See the examples provided in this document. -

    -
      -
    - - -
    -

    -  22. <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> -

    -
      -
    - - -
    -

    -  23. <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> -

    -
      -
    - - -
    -

    -  24. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html> -

    -
      -
    - - -
    -

    -  25. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/doc.html> -

    -
      -
    - - -
    -

    -  26. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/landscape.pdf> -

    -
      -
    - - -
    -

    -  27. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/portrait.pdf> -

    -
      -
    - - -
    -

    -  28. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/plain.txt> -

    -
      -
    - - -
    -

    -  29. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/sax.xml> -

    -
      -
    - - -
    -

    -  30. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/dom.xml> -

    -
      -
    - - -
    -

    -  31. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/concordance.html> -

    -
      -
    - - -
    -

    -  32. discontinued for the time being -

    -
      -
    - - -
    -

    -  33. SiSU markup syntax, an incomplete summary: <http://www.jus.uio.no/sisu/sisu_markup_table/doc.html#h200306>
    Visual check of elementary font face modifiers: bold bold emphasis italics underscore strikethrough superscript subscript -

    -
      -
    - - -
    -

    -  34. the program that generates the documents -

    -
      -
    - - -
    -

    -  35. <http://www.jus.uio.no/sisu/SiSU/1.html#search> -

    -
      -
    - - -
    -

    -  36. <http://dublincore.org/> -

    -
      -
    - - -
    -

    -  37. <http://www.w3.org/RDF/> -

    -
      -
    - - -
    -

    -  38. The previous way was directory associations for file output were set up in the configuration file. The present system is a more natural way to work requireing less configuration. -

    -
      -
    - - -
    -

    -  39. from a version control system such as CVS -

    -
      -
    - - -
    -

    -  40. The version control system must be run, so the version number is obtained, prior to the SiSU document generation, and subsequent posting of the document. -

    -
      -
    - - -
    -

    -  41. e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), SQL (e.g. document set searches; representation of the constituent parts of documents based on their structure, headings, chapters, paragraphs as desired; control of use) -

    -
      -
    - - -
    -

    -  42. CISG <http://www.jus.uio.no/lm/un_contracts_international_sale_of_goods_convention_1980/doc>
    The Unidroit Contract Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994/doc> or
    The Autonomous Contract <http://www.jus.uio.no/lm/autonomous.contract.2000.amissah/doc> -

    -
      -
    - - -
    -

    -  43. Scrolling is not however necessarily confined to full length documents as you will have to scroll to get to the bottom of any long segment (eg. chapter) of a segmented text. -

    -
      -
    - - -
    -

    -  44. CISG <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980>
    The Unidroit Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994>
    The Autonomous Contract <http://www.jus.uio.no/sisu/the.autonomous.contract.2000.amissah> or
    WTA 1994 <http://www.jus.uio.no/lm/wta.1994> -

    -
      -
    - - -
    -

    -  45. formatting possibility still exists in code tree but maintenance has been largely discontinuted. -

    -
      -
    - - -
    -

    -  46. <http://www.mozilla.org/products/firefox/> -

    -
      -
    - - -
    -

    -  47. <http://kazehakase.sourceforge.jp/> -

    -
      -
    - - -
    -

    -  48. <http://www.konqueror.org/> -

    -
      -
    - - -
    -

    -  49. <http://www.mozilla.org/> -

    -
      -
    - - -
    -

    -  50. <http://www.microsoft.com/windows/ie/default.asp> -

    -
      -
    - - -
    -

    -  51. <http://home.netscape.com/comprod/mirror/client_download.html> -

    -
      -
    - - -
    -

    -  52. <http://www.opera.com/> -

    -
      -
    - - -
    -

    -  53. <http://www.dillo.org/> -

    -
      -
    - - -
    -

    -  54. <http://www.gnome.org/projects/epiphany/> -

    -
      -
    - - -
    -

    -  55. <http://galeon.sourceforge.net/> -

    -
      -
    - - -
    -

    -  56. <http://elinks.or.cz/> -

    -
      -
    - - -
    -

    -  57. <http://links.twibright.com/> -

    -
      -
    - - -
    -

    -  58. <http://w3m.sourceforge.net/> -

    -
      -
    - - -
    -

    -  59. LaTeX and pdf features introduced 18th June 2001, Landscape and portrait pdfs introduced 7th October 2001., Lout is a more recent addition 22th April 2003 -

    -
      -
    - - -
    -

    -  60. <http://www.adobe.com/products/acrobat/readstep2.html> -

    -
      -
    - - -
    -

    -  61. <http://www.gnome.org/projects/evince/> -

    -
      -
    - - -
    -

    -  62. <http://www.foolabs.com/xpdf/> -

    -
      -
    - - -
    -

    -  63. <http://www.postgresql.org/>
    <http://advocacy.postgresql.org/>
    <http://en.wikipedia.org/wiki/Postgresql> -

    -
      -
    - - -
    -

    -  64. <http://www.hwaci.com/sw/sqlite/>
    <http://en.wikipedia.org/wiki/Sqlite> -

    -
      -
    - - -
    -

    -  65. Relational database features retaining document structure and citation introduced 15th July 2002 -

    -
      -
    - - -
    -

    -  66. <http://search.sisudoc.org> -

    -
      -
    - - -
    -

    -  67. (which could be extended further with current back-end). As regards scaling of the database, it is as scalable as the database (here Postgresql) and hardware allow. -

    -
      -
    - - -
    -

    -  68. of this feature when demonstrated to an IBM software innovations evaluator in 2004 he said to paraphrase: this could be of interest to us. We have large document management systems, you can search hundreds of thousands of documents and we can tell you which documents meet your search criteria, but there is no way we can tell you without opening each document where within each your matches are found. -

    -
      -
    - - -
    -

    -  69. OCN are provided for HTML, XML, pdf ... though currently omitted in plain-text and opendocument format output -

    -
      -
    - - -
    -

    -  70. Concordance/ WordMaps introduced 15th August 2002 -

    -
      -
    - - -
    -

    -  71. <http://www.vim.org/> -

    -
      -

    -^

     
    - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - - - -
    - - - TOC linked  toc  - - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    - - - - - - -
    - - << previous - - - - TOC - - - - next >> - - -
    -
    - - - - -
    - - < - - - - ^ - - - - > - -

    - - - - - - - - -
    - -
    -

    - - SiSU - -

    -
    - -

    - Output generated by - - SiSU - - 0.59.1 2007-09-25 (2007w39/2) -
    - - SiSU - - Copyright © Ralph Amissah - 1997, current 2007. - All Rights Reserved. -
    - - SiSU - - is software for document structuring, publishing and search, -
    - - www.jus.uio.no/sisu - - and - - www.sisudoc.org - -
    - w3 since October 3 1993 - - ralph@amissah.com - -

    -
    -

    - SiSU using: -
    Standard SiSU markup syntax, -
    Standard SiSU meta-markup syntax, and the -
    Standard SiSU object citation numbering and system, (object/text positioning system) -
    - Copyright © Ralph Amissah 1997, current 2007. - All Rights Reserved. -

    -
    -

    - - GPLv3 - -

    -
    -

    - - SiSU - - is released under - GPLv3 - or later, - <http://www.gnu.org/licenses/gpl.html> -

    -
    -

    - SiSU, developed using - - Ruby - - on - - Debian/Gnu/Linux - - software infrastructure, - with the usual GPL (or OSS) suspects. -
    - Better - "performance, reliability, scalability, security & total cost of ownership" - [not to mention flexibility & choice] use of and adherence to open standards (where practical and fair) and it is software libre. -
    - Get With the Future - - Way Better! - -

    -
    - - - - - -
    -
    - - -
    - - -
    - - - - idx - txt - - -
    -
    - -
    -

    - - -SiSU manual - -

    - - - - - - - - - - - -
    -

    - - -SiSU - -

    - - - - - - - - - - - - - - - - -
    - \ No newline at end of file diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/index.html b/data/doc/manuals_generated/sisu_manual/sisu_description/index.html deleted file mode 100644 index dcdd4375..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/index.html +++ /dev/null @@ -1,704 +0,0 @@ - - - - - - SiSU - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    - - - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - -
    - - TOC linked  toc  - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    -   - next >> -   -
    -

    - -

    - -
    - - -
    - -

    SiSU -

    - -

    Description -

    - -

    Ralph Amissah -

    - -copy @ - - SiSU - - -
    - - - -

    - SiSU - Description,
    Ralph Amissah -

    - -

    - SiSU an attempt to describe -

    - -

    - - 1. Description - -

    - -
    - - 1.1 Outline - -
    - -
    - - 1.2 Short summary of features - -
    - -
    - - 1.3 How it works - -
    - -
    - - 1.4 Simple markup - -
    - -
    - - 1.4.1 Sparse markup requirement, try to get the most out of markup - -
    - -
    - - 1.4.2 Single markup file provides multiple output formats - -
    - -
    - - 1.4.3 Syntax relatively easy to read and remember - -
    - -
    - - 1.4.4 Kept simple by having a limited publishing feature set, and features identified as most important, are available across several document types - -
    - -
    - - 1.5 Designed with usability in mind - -
    - -
    - - 1.6 Code separate from content - -
    - -
    - - 1.7 Object citation numbering, a text or object positioning / citation system - "paragraph" (or text object) numbering, that remains same and usable across all output formats by people and machine - -
    - -
    - - 1.8 Handling of Dublin Core meta-tags making use of the Resource Description Framework - -
    - -
    - - 1.9 Easy directory management - -
    - -
    - - 1.10 Document Version Control Information - -
    - -
    - - 1.11 Table of contents - -
    - -
    - - 1.12 Auto-numbering of headings - -
    - -
    - - 1.13 Numbering and cross-hyperlinking of endnotes - -
    - -
    - - 1.14 "Skinnable" - -
    - -
    - - 1.15 Multiple Outputs - -
    - -
    - - 1.15.1 html - several presentations: full length & segmented; css & table based - -
    - -
    - - 1.15.2 XML - -
    - -
    - - 1.15.3 ODT:ODF, Open Document Format - ISO/IEC 26300:2006 - -
    - -
    - - 1.15.4 PDF - portrait and landscape, (through the generation of LaTeX output which is then transformed to pdf) - -
    - -
    - - 1.15.5 Search - loading/populating of relational database while retaining document structure information, object citation numbering and other features (currently PostgreSQL and/or SQLite) - -
    - -
    - - 1.15.6 Search - database frontend sample, utilising database and SiSU features, including object citation numbering (backend currently PostgreSQL) - -
    - -
    - - 1.15.7 Other forms - -
    - -
    - - 1.16 Concordance / Word Map or rudimentary index - -
    - -
    - - 1.17 Managed (document) directory, database, or site structure - -
    - -
    - - 1.18 Batch processing - -
    - -
    - - 1.19 Integration to superior Gnu/Linux and Unix tools - -
    - -
    - - 1.19.1 Backup and version control - -
    - -
    - - 1.19.2 Editor support - -
    - -
    - - 1.20 Modular design, need something new add a module - -
    - -

    - - Endnotes - -

    - - - Document Information (metadata) - - -

    Rights: Copyright ©  Ralph Amissah 2007, part of SiSU documentation, License GPL 3

    -

    - -

    - - - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - -
    - - TOC linked  toc  - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    -   - next >> -   -
    -

    - -

     

    - - - - - - - - -
    - -
    -

    - - SiSU - -

    -
    - -

    - Output generated by - - SiSU - - 0.59.1 2007-09-25 (2007w39/2) -
    - - SiSU - - Copyright © Ralph Amissah - 1997, current 2007. - All Rights Reserved. -
    - - SiSU - - is software for document structuring, publishing and search, -
    - - www.jus.uio.no/sisu - - and - - www.sisudoc.org - -
    - w3 since October 3 1993 - - ralph@amissah.com - -

    -
    -

    - SiSU using: -
    Standard SiSU markup syntax, -
    Standard SiSU meta-markup syntax, and the -
    Standard SiSU object citation numbering and system, (object/text positioning system) -
    - Copyright © Ralph Amissah 1997, current 2007. - All Rights Reserved. -

    -
    -

    - - GPLv3 - -

    -
    -

    - - SiSU - - is released under - GPLv3 - or later, - <http://www.gnu.org/licenses/gpl.html> -

    -
    -

    - SiSU, developed using - - Ruby - - on - - Debian/Gnu/Linux - - software infrastructure, - with the usual GPL (or OSS) suspects. -
    - Better - "performance, reliability, scalability, security & total cost of ownership" - [not to mention flexibility & choice] use of and adherence to open standards (where practical and fair) and it is software libre. -
    - Get With the Future - - Way Better! - -

    -
    - - - - - - - -
    - -
    -
    - - -
    - - -
    - - - - idx - txt - - -
    -
    - -
    -

    - - -SiSU manual - -

    - - - - - - - - - - - -
    -

    - - -SiSU - -

    - - - - - - - - - - - - - - - - -
    - - - - diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/landscape.pdf b/data/doc/manuals_generated/sisu_manual/sisu_description/landscape.pdf deleted file mode 100644 index e202e2d6..00000000 Binary files a/data/doc/manuals_generated/sisu_manual/sisu_description/landscape.pdf and /dev/null differ diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/metadata.html b/data/doc/manuals_generated/sisu_manual/sisu_description/metadata.html deleted file mode 100644 index d2b22d83..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/metadata.html +++ /dev/null @@ -1,591 +0,0 @@ - - - - - - - - SiSU - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - - - -
    - - - TOC linked  toc  - - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    - - - - - - -
    - - << previous - - - - TOC - - -
    -
    - - - - -
    - - < - - - - ^ - - - - > - -

    - SiSU -

    -

    - Description -

    -

    - Ralph Amissah - -

    -

    - copy @ - - SiSU - -

    - - -
    -

    - Document Information (metadata) -

    -
      -

    -   0 -

    -
    - - -
    - -   -
    - - -
    -

    - - Metadata -

    -
    -

    -   0 -

    -
    - -
    -

    - <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/sisu_manifest.html> -

    -
    - -
    -

    - Dublin Core (DC) -

    -
    - -
    -

    - DC tags included with this document are provided here. -

    -
    - -
    -

    - DC Title: SiSU - Description -

    -
    - -
    -

    - DC Creator: Ralph Amissah -

    -
    - -
    -

    - DC Rights: Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3 -

    -
    - -
    -

    - DC Type: information -

    -
    - -
    -

    - DC Date created: 2002-11-12 -

    -
    - -
    -

    - DC Date issued: 2002-11-12 -

    -
    - -
    -

    - DC Date available: 2002-11-12 -

    -
    - -
    -

    - DC Date modified: 2007-08-30 -

    -
    - -
    -

    - DC Date: 2007-08-30 -

    -
    - -
    -

    - Version Information -

    -
    - -
    -

    - Sourcefile: sisu_description.sst -

    -
    - -
    -

    - Filetype: SiSU text 0.57 -

    -
    - -
    -

    - Sourcefile Digest, MD5(sisu_description.sst)= b89ccdad9f6d9c2260d8d383d6b35ccc -

    -
    - -
    -

    - Skin_Digest: MD5(/home/ralph/grotto/theatre/dbld/builds/sisu/sisu/data/doc/sisu/sisu_markup_samples/sisu_manual/_sisu/skin/doc/skin_sisu_manual.rb)= 20fc43cf3eb6590bc3399a1aef65c5a9 -

    -
    - -
    -

    - Generated -

    -
    - -
    -

    - Document (metaverse) last generated: Tue Sep 25 02:54:06 +0100 2007 -

    -
    - -
    -

    - Generated by: SiSU 0.59.1 of 2007w39/2 (2007-09-25) -

    -
    - -
    -

    - Ruby version: ruby 1.8.6 (2007-06-07 patchlevel 36) [i486-linux] -

    -

    -^

     
    - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - - - -
    - - - TOC linked  toc  - - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    - - - - - - -
    - - << previous - - - - TOC - - -
    -
    - - - - -
    - - < - - - - ^ - - - - > - -

    - - - - - - - - -
    - -
    -

    - - SiSU - -

    -
    - -

    - Output generated by - - SiSU - - 0.59.1 2007-09-25 (2007w39/2) -
    - - SiSU - - Copyright © Ralph Amissah - 1997, current 2007. - All Rights Reserved. -
    - - SiSU - - is software for document structuring, publishing and search, -
    - - www.jus.uio.no/sisu - - and - - www.sisudoc.org - -
    - w3 since October 3 1993 - - ralph@amissah.com - -

    -
    -

    - SiSU using: -
    Standard SiSU markup syntax, -
    Standard SiSU meta-markup syntax, and the -
    Standard SiSU object citation numbering and system, (object/text positioning system) -
    - Copyright © Ralph Amissah 1997, current 2007. - All Rights Reserved. -

    -
    -

    - - GPLv3 - -

    -
    -

    - - SiSU - - is released under - GPLv3 - or later, - <http://www.gnu.org/licenses/gpl.html> -

    -
    -

    - SiSU, developed using - - Ruby - - on - - Debian/Gnu/Linux - - software infrastructure, - with the usual GPL (or OSS) suspects. -
    - Better - "performance, reliability, scalability, security & total cost of ownership" - [not to mention flexibility & choice] use of and adherence to open standards (where practical and fair) and it is software libre. -
    - Get With the Future - - Way Better! - -

    -
    - - - - - -
    -
    - - -
    - - -
    - - - - idx - txt - - -
    -
    - -
    -

    - - -SiSU manual - -

    - - - - - - - - - - - -
    -

    - - -SiSU - -

    - - - - - - - - - - - - - - - - -
    - \ No newline at end of file diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/opendocument.odt b/data/doc/manuals_generated/sisu_manual/sisu_description/opendocument.odt deleted file mode 100644 index 1fa175f5..00000000 Binary files a/data/doc/manuals_generated/sisu_manual/sisu_description/opendocument.odt and /dev/null differ diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/plain.txt b/data/doc/manuals_generated/sisu_manual/sisu_description/plain.txt deleted file mode 100644 index a2a490e2..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/plain.txt +++ /dev/null @@ -1,1566 +0,0 @@ -SISU - DESCRIPTION, -RALPH AMISSAH -********************************** - -SISU AN ATTEMPT TO DESCRIBE -=========================== - -1. DESCRIPTION --------------- - -1.1 OUTLINE -........... - -*SiSU* is a flexible document preparation, generation publishing and search -system.[^1] - - -- [1]: This information was first placed on the web 12 November 2002; with - predating material taken from - part of a site started and - developed since 1993. See document metadata section - for information on this - version. Dates related to the development of *SiSU* are mostly contained - within the Chronology section of this document, e.g. - - -*SiSU* ("*SiSU* information Structuring Universe" or "Structured information, -Serialized Units"),[^2] is a Unix command line oriented framework for document -structuring, publishing and search. Featuring minimalistic markup, multiple -standard outputs, a common citation system, and granular search. - - -- [2]: also chosen for the meaning of the Finnish term "sisu". - -Using markup applied to a document, *SiSU* can produce plain text, HTML, XHTML, -XML, OpenDocument, LaTeX or PDF files, and populate an SQL database with -objects[^3] (equating generally to paragraph-sized chunks) so searches may be -performed and matches returned with that degree of granularity (e.g. your -search criteria is met by these documents and at these locations within each -document). Document output formats share a common object numbering system for -locating content. This is particularly suitable for "published" works -(finalized texts as opposed to works that are frequently changed or updated) -for which it provides a fixed means of reference of content. - - -- [3]: objects include: headings, paragraphs, verse, tables, images, but not - footnotes/endnotes which are numbered separately and tied to the object from - which they are referenced. - -*SiSU* is the data/information structuring and transforming tool, that has -resulted from work on one of the oldest law web projects. It makes possible the -one time, simple human readable markup of documents, that *SiSU* can then -publish in various forms, suitable for paper[^4], web[^5] and relational -database[^6] presentations, retaining common data-structure and -meta-information across the output/presentation formats. Several requirements -of legal and scholarly publication on the web have been addressed, including -the age old need to be able to reliably cite/pinpoint text within a document, -to easily make footnotes/endnotes, to allow for semantic document meta-tagging, -and to keep required markup to a minimum. These and other features of interest -are listed and described below. A few points are worth making early (and will -be repeated a number of times): - - -- [4]: pdf via LaTeX or lout - -- [5]: currently html (two forms of html presentation one based on css the other on - tables), and /PHP/; potentially structured XML - -- [6]: any SQL - currently PostgreSQL and /sqlite/ (for portability, testing and - development) - - (i) The *SiSU* document generator was the first to place material on the web - with a system that makes possible citation across different document types, - with paragraph, or rather object citation numbering[^7] a text positioning - system, available for the pinpointing of text, 1997, a simple idea from which - much benefit, and *SiSU* remains today, to the best of my knowledge, the only - multiple format e-book/ electronic-document system on the web that gives you - this possibility (including for relational databases). - - -- [7]: previously called "text object numbering" - - (ii) Markup is done once for the multiple formats produced. - - - (iii) Markup is simple, and human readable (with a little practice), in - almost all cases there is less and simpler markup required than basic html. - In any event the markup required is very much simpler than the html, LaTeX, - [lout], structured XML, ODF (OpenDocument), PostgreSQL or SQLite feed etc. - that you can have *SiSU* generate for you. - - - (iv) *SiSU* is a batch processor, dealing with as many files as you need to - generate at a time. - - - (v) Scalability is dependent on your file system (in my case Reiserfs), the - database (currently Postgresql and/or SQLite) and your hardware. - - -*SiSU* Sabaki[^8] (or just *SiSU*) is the provisional name given to the -software described here that helps structure documents for web and other -publication. The name *SiSU* is a loose anagram for something along the lines -of */"SiSU is structuring unit"/*, or /"*SiSU*, information structuring unit"/ -or the more descriptive /"Structured information, Serialized Units"/ or -*/"simple - information structuring unit"/* or the more descriptive -/"Structured information, Serialized Units"/ or what it may be directed towards -/"*semantic* and *information structuring universe*" /,[^9] tongue in cheek, -only just. Guess I'll get away with */"Simple - information Structuring -Universe"/*. *SiSU* is also a Finnish word roughly meaning guts, inner strength -and perseverance.[^10] - - -- [8]: *SiSU* Sabaki, release version. Pre-release version *SiSU* Scribe, and - version prior to that *SiSU* nicknamed Scribbler. Pre-release versions go back - several years. Both Scribbler and Scribe (still maintained) made system calls - to *SiSU*'s various parts, instead of using libraries. - -- [9]: A little universe it may be, but semantic you may have a hard time getting - away with, given the meaning the word has taken on with markup. On a document - wide basis semantic information may be provided, which can be really useful, - (and meaningful, especially) if you have a large document set, and use this - with rss feeds or in an sql database etc. On a markup level, I have little - inclination to add semantic markup formally beyond references, title, author - [Dublin Core entities? addresses?] etc. Actually this deserves a bit of - thought possibly use letter tags (including letter alias/synonyms for font - faces) to create a small set of default semantic tags, with the possibility - for per document adjustments. Will seek to permit XML entity tagging, within - *SiSU* markup and have that ignored/removed by the parts of the program that - have no use for it. - -- [10]: "Sisu refers not to the courage of optimism, but to a concept of life that - says, 'I may not win, but I will gladly give my life for what I believe.'" - Aini Rajanen, Of Finnish Ways, 1981, p. 10. - -- - -- "Every Finn has his own pet definition. To me, sisu means patience without - passion. But there are many varieties of sisu. Sisu can be a sudden outburst - or it can be the kind that lasts. A man can have both kinds. It is outside - reason. It is something in the soul. It comes from oneself. For instance, it - makes a soldier do things because he himself must, not because he has been - told." Paavo Nurmi - -- - -*SiSU* was born of the need to find a way, with minimal effort, and for as wide -a range of document types as possible, to produce high quality publishing -output in a variety of document formats. As such it was necessary to find a -simple document representation that would work across a large number of -document types, and the most convenient way(s) to produce acceptable output -formats. The project leading to this program was started in 1993 (together with -the trade law project now known as Lex Mercatoria) as an investigation of how -to effectively/efficiently place documents on the web. The unified document -handling, together with features such as paragraph numbering, endnote handling -and tables... appeared in 1996/97. *SiSU* was originally written in Perl,[^11] -and converted to *Ruby*, [^12] in 2000, one of the most impressive programming -languages in existence! In its current form it has been written to run on the -*Gnu* /Linux platform, and in particular on *Debian*, [^13] taking advantage of -many of the wonderful projects that are available there. - - -- [11]: - -- [12]: - -- [13]: - -*SiSU* markup is based on requiring the minimum markup needed to determine the -structure of a document. (This can be as little as saying in a header to look -for the word Book at a specified level and the word Chapter at another level). -*SiSU* then breaks a document into its smallest parts (at a heading, and -paragraph level) while retaining all structural information. This break up of -the document and information on its structure is taken advantage of in the -transformations made in generating the very different output types that can be -created, and in providing as much as can be for what each output type is best -at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf -or Postscript), XML (in this case, structural representation), ODF -(OpenDocument [experimental]), SQL (e.g. document search; representing -constituent parts of documents based on their structure, headings, chapters, -paragraphs as required; user control).[^14] - - -- [14]: where explicit structure is provided through the use of tagging headings, - it could be reduced (still) further, for example by reducing the number of - characters used to identify heading levels; but in many cases even that - information is not required as regular expressions can be used to extract the - implicit structure. - -From markup that is simpler and more sparse than html you get: - - -* far greater output possibilities, including html, XML, ODF (OpenDocument), -LaTeX (pdf), and SQL; - - -* the advantages implicit in the very different output possibilities; - - -* a common citation system (for all outputs - including the relational -database, search results are relevant for all outputs); - - -For more see the short summary of features provided below. - - -*SiSU* processes files with minimal tagging to produce various document outputs -including html, LaTeX or lout (which is converted to pdf) and if required loads -the structured information into an SQL database (PostgreSQL and SQLite have -been used for this). *SiSU* produces an intermediate processing format.[^15] - - -- [15]: This proved to be the easiest way to develop syntax, changes could be made, - or alternatives provided for the markup syntax whilst the intermediate markup - syntax was largely held constant. There is actually an optional second - intermediate markup format in YAML - -*SiSU* is used in constructing Lex Mercatoria or - (one of the oldest law web sites), and considerable -thought went into producing output that would be suitable for legal and -academic writings (that do not have formulae) given the limitations of html, -and publication in a wide variety of "formats", in particular in relation to -the convenient and accurate citation of text. However, the construction of Lex -Mercatoria uses only a fraction of the features available from *SiSU* today, -/vis/ generation of flat file structures, rather than in addition the building -of ("granular") SQL database content, (at an object level with relevant -relational tables, and other outputs also available). - - -1.2 SHORT SUMMARY OF FEATURES -............................. - -*(i)* markup syntax: (a) simpler than html, (b) mnemonic, influenced by -mail/messaging/wiki markup practices, (c) human readable, and easily writable, - - -*(ii)* (a) minimal markup requirement, (b) single file marked up for multiple -outputs, - - -notes: - - -* documents are prepared in a single UTF-8 file using a minimalistic mnemonic -syntax. Typical literature, documents like "War and Peace" require almost no -markup, and most of the headers are optional. - - -* markup is easily readable/parsed by the human eye, (basic markup is simpler -and more sparse than the most basic html), [this may also be converted to XML -representations of the same input/source document]. - - -* markup defines document structure (this may be done once in a header -pattern-match description, or for heading levels individually); basic text -attributes (bold, italics, underscore, strike-through etc.) as required; and -semantic information related to the document (header information, extended -beyond the Dublin core and easily further extended as required); the headers -may also contain processing instructions. - - -*(iii)* (a) multiple outputs primarily industry established and institutionally -accepted open standard formats, include amongst others: plaintext (UTF-8); -html; (structured) XML; ODF (Open Document text)l; LaTeX; PDF (via LaTeX); SQL -type databases (currently PostgreSQL and SQLite). Also produces: concordance -files; document content certificates (md5 or sha256 digests of headings, -paragraphs, images etc.) and html manifests (and sitemaps of content). (b) -takes advantage of the strengths implicit in these very different output types, -(e.g. PDFs produced using typesetting of LaTeX, databases populated with -documents at an individual object/paragraph level, making possible granular -search (and related possibilities)) - - -*(iv)* outputs share a common numbering system (dubbed "object citation -numbering" (ocn)) that is meaningful (to man and machine) across various -digital outputs whether paper, screen, or database oriented, (PDF, html, XML, -sqlite, postgresql), this numbering system can be used to reference content. - - -*(v)* SQL databases are populated at an object level (roughly headings, -paragraphs, verse, tables) and become searchable with that degree of -granularity, the output information provides the object/paragraph numbers which -are relevant across all generated outputs; it is also possible to look at just -the matching paragraphs of the documents in the database; [output indexing also -work well with search indexing tools like hyperesteier]. - - -*(vi)* use of semantic meta-tags in headers permit the addition of semantic -information on documents, (the available fields are easily extended) - - -*(vii)* creates organised directory/file structure for (file-system) output, -easily mapped with its clearly defined structure, with all text objects -numbered, you know in advance where in each document output type, a bit of text -will be found (e.g. from an SQL search, you know where to go to find the -prepared html output or PDF etc.)... there is more; easy directory management -and document associations, the document preparation (sub-)directory may be used -to determine output (sub-)directory, the skin used, and the SQL database used, - - -*(viii)* "Concordance file" wordmap, consisting of all the words in a document -and their (text/ object) locations within the text, (and the possibility of -adding vocabularies), - - -*(ix)* document content certification and comparison considerations: (a) the -document and each object within it stamped with an md5 hash making it possible -to easily check or guarantee that the substantive content of a document is -unchanged, (b)version control, documents integrated with time based source -control system, default RCS or CVS with use of $Id: sisu_description.sst,v 1.25 -2007/08/23 12:22:36 ralph Exp $ tag, which *SiSU* checks - - -*(x)* *SiSU*'s minimalist markup makes for meaningful "diffing" of the -substantive content of markup-files, - - -*(xi)* easily skinnable, document appearance on a project/site wide, directory -wide, or document instance level easily controlled/changed, - - -*(xii)* in many cases a regular expression may be used (once in the document -header) to define all or part of a documents structure obviating or reducing -the need to provide structural markup within the document, - - -*(xiii)* prepared files may be batch process, documents produced are static -files so this needs to be done only once but may be repeated for various -reasons as desired (updated content, addition of new output formats, updated -technology document presentations/representations) - - -*(xiv)* possible to pre-process, which permits: the easy creation of standard -form documents, and templates/term-sheets, or; building of composite documents -(master documents) from other sisu marked up documents, or marked up parts, -i.e. import documents or parts of text into a main document should this be -desired - - -there is a considerable degree of future-proofing, output representations are -"upgradeable", and new document formats may be added. - - -*(xv)* there is a considerable degree of future-proofing, output -representations are "upgradeable", and new document formats may be added: (a) -modular, (thanks in no small part to *Ruby*) another output format required, -write another module.... (b) easy to update output formats (eg html, XHTML, -LaTeX/PDF produced can be updated in program and run against whole document -set), (c) easy to add, modify, or have alternative syntax rules for input, -should you need to, - - -*(xvi)* scalability, dependent on your file-system (ext3, Reiserfs, XFS, -whatever) and on the relational database used (currently Postgresql and -SQLite), and your hardware, - - -*(xvii)* only marked up files need be backed up, to secure the larger document -set produced, - - -*(xviii)* document management, - - -*(xix)* Syntax highlighting for *SiSU* markup is available for a number of text -editors. - - -*(xx)* remote operations: (a) run *SiSU* on a remote server, (having prepared -sisu markup documents locally or on that server, i.e. this solution where sisu -is installed on the remote server, would work whatever type of machine you -chose to prepare your markup documents on), (b) generated document outputs may -be posted by sisu to remote sites (using rsync/scp) (c)document source -(plaintext utf-8) if shared on the net may be identified by its url and -processed locally to produce the different document outputs. - - -*(xxi)* document source may be bundled together (automatically) with associated -documents (multiple language versions or master document with inclusions) and -images and sent as a zip file called a sisupod, if shared on the net these too -may be processed locally to produce the desired document outputs, these may be -downloaded, shared as email attachments, or processed by running sisu against -them, either using a url or the filename. - - -*(xxii)* for basic document generation, the only software dependency is *Ruby*, -and a few standard Unix tools (this covers plaintext, html, XML, ODF, LaTeX). -To use a database you of course need that, and to convert the LaTeX generated -to PDF, a LaTeX processor like tetex or texlive. - - -as a developers tool it is flexible and extensible - - -*SiSU* was developed in relation to legal documents, and is strong across a -wide variety of texts (law, literature...). *SiSU* handles images but is not -suitable for formulae/ statistics, or for technical writing at this time. - - -*SiSU* has been developed and has been in use for several years. Requirements -to cover a wide range of documents within its use domain have been explored. - - -Some modules are more mature than others, the most mature being Html and LaTeX -/ pdf. PostgreSQL and search functions are useable and together with /ocn/ -unique (to the best of my knowledge). The XML output document set is "well -formed" but largely proof of concept. - - -1.3 HOW IT WORKS -................ - -*SiSU* markup is fairly minimalistic, it consists of: a (largely optional) -document header, made up of information about the document (such as when it was -published, who authored it, and granting what rights) and any processing -instructions; and markup within text which is related to document structure and -typeface. *SiSU* must be able to discern the structure of a document, (text -headings and their levels in relation to each other), either from information -provided in the instruction header or from markup within the text (or from a -combination of both). Processing is done against an abstraction of the document -comprising of information on the document's structure and its objects,[^16] -which the program serializes (providing the object numbers) and which are -assigned hash sum values based on their content. This abstraction of -information about document structure, objects, (and hash sums), provides -considerable flexibility in representing documents different ways and for -different purposes (e.g. search, document layout, publishing, content -certification, concordance etc.), and makes it possible to take advantage of -some of the strengths of established ways of representing documents, (or indeed -to create new ones). - - -- [16]: objects include: headings, paragraphs, verse, tables, images, but not - footnotes/endnotes which are numbered separately and tied to the object from - which they are referenced. - -1.4 SIMPLE MARKUP -................. - -*SiSU* markup is based on requiring the minimum markup needed to determine the -structure of a document. (This can be as little as saying in a header to look -for the word Book at a specified level and the word Chapter at another level). -*SiSU* then breaks a document into its smallest parts (at a heading, and -paragraph level) while retaining all structural information. This break up of -the document and information on its structure is taken advantage of in the -transformations made in generating the very different output types that can be -created, and in providing as much as can be for what each output type is best -at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf -or Postscript), XML (in this case, structural representation), ODF -(OpenDocument), SQL (e.g. document search; representing constituent parts of -documents based on their structure, headings, chapters, paragraphs as required; -user control).[^17] - - -- [17]: where explicit structure is provided through the use of tagging headings, - it could be reduced (still) further, for example by reducing the number of - characters used to identify heading levels; but in many cases even that - information is not required as regular expressions can be used to extract the - implicit structure. - -1.4.1 SPARSE MARKUP REQUIREMENT, TRY TO GET THE MOST OUT OF MARKUP -.................................................................. - -One of its strengths is that very small amounts of initial tagging is required -for the program to generate its output. - - -This is a basic markup example: - - -* basic markup example, text file - an international convention [link:] - -[^18] - - -- [18]: - output provided as example in the next section - -* view basic markup, as it would be highlighted by vim editor [link:] - -[^19] - - -- [19]: - as it would appear with syntax highlighting (by vim) - -Emphasis has been on simplicity and minimalism in markup requirements. Design -philosophy is to try keep the amount of markup required low, for whatever has -been determined to be acceptable output.[^20] - - -- [20]: seems there are several "smart ASCIIs" available, primarily for ascii to - html conversion, that make this, and reasonable looking ascii their goal - -- - -- - -- - -*SiSU*'s markup is more minimalistic and simpler than (the equivalent) html and -for it, you get considerably more than just html, as this preparation gives you -all available output formats, upon request. - - -1.4.2 SINGLE MARKUP FILE PROVIDES MULTIPLE OUTPUT FORMATS -......................................................... - -For each document, there is only one (input, minimalistically marked up) file -from which all the available output types are generated.[^21] - - -- [21]: These include richly laid out and linked html (table or css variants), - /PHP/, LaTeX (from which pdf portrait and landscape documents are produced), - texinfo (for info files etc.), and PostgreSQL and/or SQLite. And the - opportunity to fairly easily build additional modules, such as XML. See the - examples provided in this document. - -Eg. the markup example: - - -* original text file - an international convention [link:] - -[^22] - - -- [22]: - -* view as syntax would be highlighted by vim editor [link:] - -[^23] - - -- [23]: - -Produces the following output: - - -* Segmented html version of document [link:] - -[^24] - - -- [24]: - -* Full length html document [link:] - -[^25] - - -- [25]: - -* pdf landscape version of document [link:] - -[^26] - - -- [26]: - -* pdf portrait version of document [link:] - -[^27] - - -- [27]: - -* clean tex ascii version of document [link:] - -[^28] - - -- [28]: - -* /xml/ sax version of document [link:] - -[^29] - - -- [29]: - -* /xml/ dom version of document [link:] - -[^30] - - -- [30]: - -* Concordance [link:] - -[^31] - - -- [31]: - -(and in addition to these: PostgreSQL, SQLite, texinfo and YAML -[^32] versions if desired) - - -- [32]: discontinued for the time being - -1.4.3 SYNTAX RELATIVELY EASY TO READ AND REMEMBER -................................................. - -Syntax is kept simple and mnemonic.[^33] - - -- [33]: *SiSU* markup syntax, an incomplete summary: - - -- Visual check of elementary font face modifiers: *bold* *bold* - emphasis /italics/ _underscore_ strikethrough - ^superscript^ [subscript] - -1.4.4 KEPT SIMPLE BY HAVING A LIMITED PUBLISHING FEATURE SET, AND FEATURES -IDENTIFIED AS MOST IMPORTANT, ARE AVAILABLE ACROSS SEVERAL DOCUMENT TYPES -.............................................................................. - -To keep *SiSU* markup sparse and simple *SiSU* deliberately provides a limited -publishing feature set, including: indent levels; bold; italics; superscript; -subscript; simple tables; images; tables of contents and; endnotes. Which in -most cases are available across the different output formats. - - -The publishing feature set may be expanded as required. - - -1.5 DESIGNED WITH USABILITY IN MIND -................................... - -Output is designed to be uniform, easy to read, navigate and cite. - - -1.6 CODE SEPARATE FROM CONTENT -.............................. - -Code[^34] is separated from content. This means that when changes are desired -in the output presentation, the code that produces them, and not the marked up -text data set (which could be thousands of documents) is modified. Separating -code from content makes large scale changes to output appearance trivial, and -permits the easy addition of new output modules. - - -- [34]: the program that generates the documents - -1.7 OBJECT CITATION NUMBERING, A TEXT OR OBJECT POSITIONING / CITATION SYSTEM - -"PARAGRAPH" (OR TEXT OBJECT) NUMBERING, THAT REMAINS SAME AND USABLE ACROSS ALL -OUTPUT FORMATS BY PEOPLE AND MACHINE -.............................................................................. - -Object citation numbering is a simple object (text) positioning and cition -system that is human relevant and machine useable, used by *SiSU* for all -manner of presentations, and that is available for use in all text mappings. It -is based on the automated sequential numbering of objects (roughly paragraphs, -(headings, tables, verse) or other blocks of text or images etc.). The text -positioning system (in which I claim copyright) is invaluable for publishing -requiring the citing text across multiple output formats, and for the general -mapping of text within a document: - - -* in html, html not being easily citeable (change font size, or use a different -browser and the page on which specific text appears has changed), and - - -* across multiple formats being common to all output formats html/xml/pdf/sql -output, - - -* the results of an sql search can just be "live" citation references to the -documents in which the text is found, much like an index (see image examples -provided). [link:] [^35] - - -- [35]: - -I claim copyright on the system I use which is the most basic of all, numbering -all text in headings and paragraphs sequentially (with tables and images being -treated as a single paragraph) and only footnotes/endnotes not following this -numbering, as their position in text is not strictly determined, (a change from -footnotes to endnotes would change their numbering), footnotes instead "belong" -to the paragraph from which they are referenced, and have sequential numbers of -their own. - - -*SiSU* has a paragraph numbering system, that remains the same regardless of -the output format. This provides an effective means of citation, pinpointing -text accurately in all output formats, using the same reference. This is -particularly useful where text has to be located across different output -formats - for example once html is printed the number of pages and pages on -which given text is found will vary depending on the browser, its settings the -font size setting etc. Similarly *SiSU* produces pdf in different forms, eg. on -the example site Lex Mercatoria as portrait and landscape documents - here too -page numbering varies, but paragraph numbering is the same, /vis a vis/ all -versions of the text (portrait and landscape pdf and the html versions of the -text, and as stored (with "paragraphs" as records) to the PostgreSQL or SQLite -database). - - -These numbers are placed in the text margins and are intended to be independent -of and not to interfere with authors tagging. [The citation system (object -citation numbering system, automated "paragraph numbering") which is -automatically generated and is common and identical across all document -formats] The paragraph numbering system is more accurately described as an -(text) object numbering system, as headings are also numbered... all headings -and paragraphs are numbered sequentially. Endnotes are automatically numbered -independently and rather "belong" to the paragraph from which they are -referenced, as an endnote does not (necessarily) form a part of a documents -sequence, (they may be produced as either endnotes or footnotes (or both -depending on what output you choose to look at - if you take the segmented html -version document provided as an example, you will find that the endnotes are -placed both at the end of each section, and in a separate section of their own -called endnotes, and these are hyper-linked)). An attractive feature of -providing citation numbering in this way is that it is independent of the -document structure... it remains the same regardless of what is done about the -document structure. - - -The rules have been kept very simple, unique incremental object citation -numbers are assigned to headings, paragraphs, verse, tables and images. It is -possible to manually override this feature on a per heading or comment basis -though this should be used exceptionally, it may be of use where there a -substantive text, and the addition of a minor comment by the publisher that -should not be mapped as part of the text. - - -The object citation number markers contain additional numbering information -with regard to the document structure, that can be used for alternative -presentations, including such detail as the type of object (heading, paragraph, -table, image, etc.), numbered sequentially. - - -An advantage is that the numbering remains the same regardless of document -structure. - - -Text object ("paragraph") numbering is the same for all output versions of the -same document, vis html, pdf, pgsql, yaml etc. - - -In the relational database, as individual text objects of a document stored -(and indexed) together with object numbers, and all versions of the document -have the same numbering, the results of searches may be tailored just to -provide the location of the search result in all available document formats. - - -/ Note: there is a bug in the released behaviour of object citation numbering, -(not certain when it was introduced) tables should be numbered, ie each table -gets an ocn, required amongst other things for relational database. This will -be corrected in a future release. Citation numbering of existing documents that -contain tables will changed. / - - -1.8 HANDLING OF DUBLIN CORE META-TAGS MAKING USE OF THE RESOURCE DESCRIPTION -FRAMEWORK -.............................................................................. - -*SiSU* is able to use meta tags based on the Dublin Core[^36] and Resource -Description Framework[^37] - - -- [36]: - -- [37]: - -This provides the means of providing semantic information about a document, -both as computer processable meta-tags, and as human readable information that -may be of value for classification purposes. - - -This information is provided both in html metatags, and (where available) under -the section titled "Document Information - MetaData", near the end of a -document, for example in the segmented html version of this text at: - - - -1.9 EASY DIRECTORY MANAGEMENT -............................. - -1. Directory file association, skins and special image management, made -simpler.[^38] - - -- [38]: The previous way was directory associations for file output were set up in - the configuration file. The present system is a more natural way to work - requireing less configuration. - -The last part of the name of the work directory in which markup is being done, -or rather from where *SiSU* is run in order to generate document output, is -used in determining the sub-directory name for output files, that is created in -the document output directory. This provides a rather easy way to associate -documents e.g. of a given subject, or by owner. - - - - /www/docs - /intellectual_property - /arbitration - /contract_law - /www/docs - /ralph - /sisu - -all are placed in their own directories within the directory structure created. -Similar rules are used in the creation of sql type databases (though they can -be overridden). - - -There are a couple of further associations with these directories. - - -Directory wide skins. - - -Directory specific images. - - -2. If there is a "directory skin", that is a skin of the same name as the -directory, it is used in the generation of the documents within it, rather than -the default skin, unless the document has a specific skin associated with it. - - - a. default skin (always available) - - - b. directory skin (precedence over default if exists) - - - c. document skin (takes precedence wherever document requests a specific - skin) - - -Skins are defined in the document skin directory and if a directory association -is desired a softlink made to the relevant skin. Skins (directory association -auto load) auto load skin if a directory skin exists of same name as directory -stub, (and there is no specific doc skin) - - -3. If the working directory has within it a sub-directory called image_local, -the images within that directory are used for references to images, that are -not part of the default site build. - - -1.10 DOCUMENT VERSION CONTROL INFORMATION -......................................... - -The possibility of citing an exact document version. - - -Permits the inclusion of document version control information to the document -body and metatags.[^39] This provides a much more certain method of referring -to the exact version of a particular document, (assuming that the document is -from a trusted source, that will retain earlier versions of a document).[^40] - - -- [39]: from a version control system such as CVS - -- [40]: The version control system must be run, so the version number is obtained, - prior to the *SiSU* document generation, and subsequent posting of the - document. - -This information (where available) is provided under the section of the -document titled "Document Information - MetaData", near the end of a document, -for example in the segmented html version of this text at: - - - -1.11 TABLE OF CONTENTS -...................... - -*SiSU* produces a rudimentary a table of contents based on document headings. - - -1.12 AUTO-NUMBERING OF HEADINGS -............................... - -Headings can be automatically numbered, (and automatically named for -hyper-linking) - - -1.13 NUMBERING AND CROSS-HYPERLINKING OF ENDNOTES -................................................. - -*SiSU* can automatically number footnotes/endnotes. This is the default -operation where no number is provided. - - -Footnotes/endnotes may also be manually numbered. Where a number, or numbers -are provided for a footnote/endnote, this does not increment the automatic -footnote/endnote number counter. - - -In the html output footnotes/endnotes are cross-hyper-linked (to their -reference point and vice versa). In th pdf output footnotes are linked from -their reference point only. - - -1.14 "SKINNABLE" -................ - -*SiSU* is skinnable, on a site-wide, directory-wide and per document basis, so -different looking versions of things may be produced with little difficulty. -There is a default skin which may be modified, as the background site skin, and -each working directory may have a skin associated with it, as may each -individual document. The hierarchy of application is document, directory, then -site... ie if a document skin exists it gets precedence. - - -Whilst it is skinnable, the default output styles are selected to work across -the widest possible range of document types. - - -1.15 MULTIPLE OUTPUTS -..................... - -From markup that is simpler and more sparse than html you get: - - -* far greater output possibilities, including multiple html types, XML -(different structured types), LaTeX (pdf landscape, portrait), and SQL -(Postgresql or SQLite or other); - - -* the advantages implicit in these very different output possibilities;[^41] - - -- [41]: e.g. LaTeX (professional document typesetting, easy conversion to pdf or - Postscript), XML (in this case, structural representation), SQL (e.g. document - set searches; representation of the constituent parts of documents based on - their structure, headings, chapters, paragraphs as desired; control of use) - -* a common citation system - - -As many output formats/presentations as one cares to write modules for - -several types of html (e.g. structure based on css, or structure based on -tables); /LaTeX/pdf/ and /Lout/pdf/; pgsql other databases easily added; -yaml... - - -1.15.1 HTML - SEVERAL PRESENTATIONS: FULL LENGTH & SEGMENTED; CSS & TABLE BASED -.............................................................................. - -Most documents are produced in single and segmented html versions, described -below: - - -*The Scroll (full length text presentations)* - - -The full length of the text in a single scrollable document.[^42] As a rule the -files they are saved in are named: /doc/ or more precisely /doc.html/ - - -- [42]: CISG - - -- The Unidroit Contract Principles - or - -- The Autonomous Contract - - -For various reasons texts may only be provided in this form (such as this one -which is short), though most are also provided as segmented texts. - - -"Scroll" is a reference to the historical scroll, a single long document/ -parchment, and also no doubt to what you will have to do to get to the bottom -of the text.[^43] - - -- [43]: Scrolling is not however necessarily confined to full length documents as - you will have to scroll to get to the bottom of any long segment (eg. chapter) - of a segmented text. - -*The Segmented Text* - - -The text divided into segments (such as articles or chapters depending on the -text)[^44] As a rule the files they are saved in are named: /toc/ and /index/ -or more precisely /toc.html/ and /index.html/ - - -- [44]: CISG - - -- The Unidroit Principles - - -- The Autonomous Contract - or - -- WTA 1994 - -If you know exactly what you are looking for, loading a segment of text is -faster (the segments being smaller). Occasionally longer documents such as the -WTA 1994 are only provided in segmented -form. - - -*Cascading Style Sheet, and Table based html* - - -*SiSU* outputs html, two current standard forms available are: - - -css based [link:] - - -and - - -table based [largely discontinued ][^45] - - -- [45]: formatting possibility still exists in code tree but maintenance has been - largely discontinuted. - -*The html is tested across several browsers* - - -I like to remind you that there are other excellent browsers out there, many of -which have long supported practical features like tabbing. - - -The html is tested across several browsers, including: - - -* *Firefox* (Mozilla-Firefox) [link:] - [^46] - - -- [46]: - -* Kazehakase [link:] [^47] - - -- [47]: - -* Konqueror [link:] [^48] - - -- [48]: - -* Mozilla [link:] [^49] - - -- [49]: - -* MS Internet Explorer [link:] - [^50] - - -- [50]: - -* Netscape [link:] - [^51] - - -- [51]: - -* Opera [link:] [^52] - - -- [52]: - -Also lighter weight graphical browsers: - - -* Dillo [link:] [^53] - - -- [53]: - -* *Epiphany* [link:] [^54] - - -- [54]: - -* *Galeon* [link:] [^55] - - -- [55]: - -And for console/text browsing: - - -* *elinks* [link:] [^56] - - -- [56]: - -* *links2* [link:] [^57] - - -- [57]: - -* *w3m* [link:] [^58] - - -- [58]: - -The html tables output is rendered more accurately across a wider variety set -and older versions of browsers (than the html css output). - - -1.15.2 XML -.......... - -*SiSU* generates well formed XML, and multiple versions. An XML SAX version -with a flat/shallow structure, and XML DOM version with a deeper (embedded) -structure. There is also a released working xhtml module. Examples of SAX and -DOM versions are provided within this document. - - -1.15.3 ODT:ODF, OPEN DOCUMENT FORMAT - ISO/IEC 26300:2006 -......................................................... - -*SiSU* generates Open Document Output format. - - -1.15.4 PDF - PORTRAIT AND LANDSCAPE, (THROUGH THE GENERATION OF LATEX OUTPUT -WHICH IS THEN TRANSFORMED TO PDF) -.............................................................................. - -*SiSU* outputs LaTeX if required which is easily transformed to PDF.[^59] PDF -documents are generated on the site from the same source files and *Ruby* -program that produce html. Landscape oriented pdf introduced, providing easier -screen viewing, they are also (paper saving, being currently) formatted to have -fewer pages than their portrait equivalents. - - -- [59]: LaTeX and pdf features introduced 18^th^ June 2001, Landscape and portrait - pdfs introduced 7^th^ October 2001., Lout is a more recent addition 22^th^ - April 2003 - -* Adobe Reader [link:] -[^60] - - -- [60]: - -* *Evince* [link:] [^61] - - -- [61]: - -* xpdf [link:] [^62] - - -- [62]: - -1.15.5 SEARCH - LOADING/POPULATING OF RELATIONAL DATABASE WHILE RETAINING -DOCUMENT STRUCTURE INFORMATION, OBJECT CITATION NUMBERING AND OTHER FEATURES -(CURRENTLY POSTGRESQL AND/OR SQLITE) -.............................................................................. - -*SiSU* (from the same markup input file) automatically feeds into -PostgreSQL[^63] and/or SQLite[^64] database (could be any other of the better -relational databases)[^65] - together with all additional information related -to document structure, and the alternative ways in which it is generated on the -site retained. As regards scaling of the database, it is as scalable as the -database (here Postgresql or SQLite) and hardware allow. I will prune the -images later. - - -- [63]: - -- - -- - -- [64]: - -- - -- [65]: Relational database features retaining document structure and citation - introduced 15^th^ July 2002 - -This is one of the more interesting output forms, as all the structural data -for the documents are retained (though can be ignored by the user of the -database should they so choose). All site texts/documents are (currently) -streamed to four pgsql database tables: - - - * one containing semantic (and other) headers, including, title, author, - subject, (the Dublin Core...); - - - * another the substantive texts by individual "paragraph" (or object) - along - with structural information, each paragraph being identifiable by its - paragraph number (if it has one which almost all of them do), and the - substantive text of each paragraph quite naturally being searchable (both in - formatted and clean text versions for searching); and - - - * a third containing endnotes cross-referenced back to the paragraph from - which they are referenced (both in formatted and clean text versions for - searching). - - - * a fourth table with a one to one relation with the headers table contains - full text versions of output, eg. pdf, html, xml, and ascii. - - -There is of course the possibility to add further structures. - - -At this level *SiSU* loads a relational database with documents broken in to -their smallest logical structurally constituent parts, as text objects, with -their object citation number and all other structural information needed to -construct the structured document. Text is stored (at this text object level) -with and without elementary markup tagging, the stripped version being so as to -facilitate ease of searching. - - -Because the document structure of sites created is clearly defined, and the -text object citation system is available for all forms of output, it is -possible to search the sql database, and either read results from that -database, or just as simply map the results to the html output, which has -richer text markup. - - -The combination of the *SiSU* citation system with a relational database is -pretty powerful, giving rise to several possibilities. As individual text -objects of a document stored (and indexed) together with object numbers, and -all versions of the document have the same numbering, complex searches can be -tailored to return just the locations of the search results relevant for all -available output formats, with live links to the precise locations in the -database or in html/xml documents; or, the structural information provided -makes it possible to search the full contents of the database and have headings -in which search content appears, or to search only headings etc. (as the Dublin -Core is incorporated it is easy to make use of that as well). - - -This is a larger scale project, (with little development on the front end -largely ignored), though the "infrastructure" has been in place since 2002. - - -1.15.6 SEARCH - DATABASE FRONTEND SAMPLE, UTILISING DATABASE AND SISU FEATURES, -INCLUDING OBJECT CITATION NUMBERING (BACKEND CURRENTLY POSTGRESQL) -.............................................................................. - -Sample search frontend [link:] [^66] A small -database and sample query front-end (search from) that makes use of the -citation system, _object citation numbering_ to demonstrates -functionality.[^67] - - -- [66]: - -- [67]: (which could be extended further with current back-end). As regards scaling - of the database, it is as scalable as the database (here Postgresql) and - hardware allow. - -*SiSU* can provide information on which documents are matched and at what -locations within each document the matches are found. These results are -relevant across all outputs using object citation numbering, which includes -html, XML, LaTeX, PDF and indeed the SQL database. You can then refer to one of -the other outputs or in the SQL database expand the text within the matched -objects (paragraphs) in the documents matched. - - -(further work needs to be done on the sample search form, which is rudimentary -and only passes simple booleans correctly at present to the SQL engine) - - -A few canned searches, showing object numbers. Search for: - - -English documents matching Linux OR Debian [link:] - - - -GPL OR Richard Stallman [link:] - - - -invention OR innovation in English language [link:] - - - -copyright in English language documents [link:] - - - -Note that the searches done in this form are case sensitive. - - -Expand those same searches, showing the matching text in each document: - - -English documents matching Linux OR Debian [link:] - - - -GPL OR Richard Stallman [link:] - - - -invention OR innovation in English language [link:] - - - -copyright in English language documents [link:] - - - -Note you may set results either for documents matched and object number -locations within each matched document meeting the search criteria; or display -the names of the documents matched along with the objects (paragraphs) that -meet the search criteria.[^68] - - -- [68]: of this feature when demonstrated to an IBM software innovations evaluator - in 2004 he said to paraphrase: this could be of interest to us. We have large - document management systems, you can search hundreds of thousands of documents - and we can tell you which documents meet your search criteria, but there is no - way we can tell you without opening each document where within each your - matches are found. - -*OCN index mode,* (object citation number) the numbers displayed are relevant -(and may be used to reference the match) in any sisu generated rendition of the -text[^69] the links provided are to the locations of matches within the html -generated by *SiSU*. - - -- [69]: OCN are provided for HTML, XML, pdf ... though currently omitted in - plain-text and opendocument format output - -*Paragraph mode,* you may alternatively display the text of each paragraph in -which the match was made, again the object/paragraph numbers are relevant to -any *SiSU* generated/published text. - - -Several options for output - select database to search, show results in index -view (links to locations within text), show results with text, echo search in -form, show what was searched, create and show a "canned url" for search, show -available search fields. Also shows counters number of documents in which found -and number of locations within documents where found. [could consider sorting -by document with most occurrences of the search result]. - - -Earlier version of the search frontend - Simple search, results with files in -which search found, and locations where found within files. - - -Simple search, results with files in which search found, and text object -(paragraph or endnote) where found within files. - - -1.15.7 OTHER FORMS -.................. - -There are other forms as well, YAML file, *Ruby* Marshal dumps, document -pre-processing (processing of documents prior to the steps described here, to -produce input suitable for the program) snap in a new module as -required/desired, well formed XML, no problem. - - -1.16 CONCORDANCE / WORD MAP OR RUDIMENTARY INDEX -................................................ - -Concordance /WordMaps:[^70] *SiSU* produces a rudimentary index based on the -words within the text, making use of paragraph numbers to identify text -locations. This is generated in html and hyper-linked but identifies these -words locations in the other document formats. Though it is possible to search -using a search engine, this is a means for browsing an alphabetical list of -words which may suggest other useful content. - - -- [70]: Concordance/ WordMaps introduced 15^th^ August 2002 - -1.17 MANAGED (DOCUMENT) DIRECTORY, DATABASE, OR SITE STRUCTURE -.............................................................. - -*SiSU* builds the web site (or more generically provides a suitable directory -structure) - placing various output texts in the hierarchy of the web-site (or -db), which (for directories) is a sub-directory with the name of the text file. - - -1.18 BATCH PROCESSING -..................... - -*SiSU* is a batch processing tool, handling and transforming multiple (or -individual) documents (in many ways) with a single instruction. - - -1.19 INTEGRATION TO SUPERIOR GNU/LINUX AND UNIX TOOLS -..................................................... - -As should have been noted by the above description of *SiSU*, it makes use of -existing programs found on *Gnu* /Linux and Unix, amongst those already -mentioned include the LaTeX to pdf converters and the database PostgreSQL or -SQLite. - - -1.19.1 BACKUP AND VERSION CONTROL -................................. - -Unix provides many tools for version control. For documents Subversion, CVS and -even the old RCS are useful for the per-document histories they provide. - - -For writing code superior (more recent) version control system exist. These can -also be used for documents though they tend to take stamps of changes across -the repository as a whole, rather than for each individual file that is -tracked, (as CVS and RCS do). My personal preference is for distributed systems -such as Git, Mercurial or Darcs, of which I use Git for both code and -documents. - - -Several backup tools exist. At the base level I tend to use rdiff. - - -1.19.2 EDITOR SUPPORT -..................... - -*SiSU* documents are prepared / marked up in utf-8 text _you are free to use -the text editor of your choice._ - - -Syntax highlighting for a number of editors are provided. Amongst them Vim, -Kwrite, Kate, Gedit and diakonos. These may be found with configuration -instructions at . Vim [link:] - [^71] as of version 7 has built in sytax highlighting for -*SiSU*. - - -- [71]: - -1.20 MODULAR DESIGN, NEED SOMETHING NEW ADD A MODULE -.................................................... - -Need a new output format that does not already exist, write a new module. - - -Prefer a new input syntax, you could write a new syntax matching the existing -design, though my personal preference is some uniformity in entry appearance. -If necessary has been fairly easy to extend the design parameters. It is -intended to incorporate some additional basic semantic tagging, (book, article, -author etc.) However, keeping the requirements for input minimal, and -relatively simple has been a design goal. - - -DOCUMENT INFORMATION (METADATA) -******************************* - -METADATA --------- - -Document Manifest @ - - - -*Dublin Core* (DC) - - -/DC tags included with this document are provided here./ - - -DC Title: _SiSU - Description_ - - -DC Creator: _Ralph Amissah_ - - -DC Rights: _Copyright (C) Ralph Amissah 2007, part of SiSU documentation, -License GPL 3_ - - -DC Type: _information_ - - -DC Date created: _2002-11-12_ - - -DC Date issued: _2002-11-12_ - - -DC Date available: _2002-11-12_ - - -DC Date modified: _2007-08-30_ - - -DC Date: _2007-08-30_ - - -*Version Information* - - -Sourcefile: _sisu_description.sst_ - - -Filetype: _SiSU text 0.57_ - - -Sourcefile Digest, MD5(sisu_description.sst)= -_b89ccdad9f6d9c2260d8d383d6b35ccc_ - - -Skin_Digest: -MD5(/home/ralph/grotto/theatre/dbld/builds/sisu/sisu/data/doc/sisu/sisu_markup_samples/sisu_manual/_sisu/skin/doc/skin_sisu_manual.rb)= -_20fc43cf3eb6590bc3399a1aef65c5a9_ - - -*Generated* - - -Document (metaverse) last generated: _Tue Sep 25 02:54:06 +0100 2007_ - - -Generated by: _SiSU_ _0.59.1_ of 2007w39/2 (2007-09-25) - - -Ruby version: _ ruby 1.8.6 (2007-06-07 patchlevel 36) [i486-linux]_ - - - -============================================================================== - - title: SiSU - Description - - creator: Ralph Amissah - - rights: Copyright (C) Ralph Amissah 2007, part of SiSU documentation, - License GPL 3 - - type: information - - subject: ebook, epublishing, electronic book, electronic publishing, - electronic document, electronic citation, data structure, - citation systems, search - - date.created: 2002-11-12 - - date.issued: 2002-11-12 - - date.available: 2002-11-12 - - date.modified: 2007-08-30 - - date: 2007-08-30 - - - - - -============================================================================== -nil - -Other versions of this document: -manifest: - http://www.jus.uio.no/sisu/sisu_description/sisu_manifest.html -html: - http://www.jus.uio.no/sisu/sisu_description/toc.html -pdf: - http://www.jus.uio.no/sisu/sisu_description/portrait.pdf - http://www.jus.uio.no/sisu/sisu_description/landscape.pdf -plaintext (plain text): - http://www.jus.uio.no/sisu/sisu_description/plain.txt -at: - http://www.jus.uio.no/sisu -* Generated by: SiSU 0.59.1 of 2007w39/2 (2007-09-25) -* Ruby version: ruby 1.8.6 (2007-06-07 patchlevel 36) [i486-linux] -* Last Generated on: Tue Sep 25 02:54:08 +0100 2007 -* SiSU http://www.jus.uio.no/sisu diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/portrait.pdf b/data/doc/manuals_generated/sisu_manual/sisu_description/portrait.pdf deleted file mode 100644 index a8a9eda1..00000000 Binary files a/data/doc/manuals_generated/sisu_manual/sisu_description/portrait.pdf and /dev/null differ diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/sax.xml b/data/doc/manuals_generated/sisu_manual/sisu_description/sax.xml deleted file mode 100644 index 32a2265b..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/sax.xml +++ /dev/null @@ -1,2731 +0,0 @@ - - - - - - - Title: - - SiSU - Description - -
    - Creator: - - Ralph Amissah - -
    - Rights: - - Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3 - -
    - Type: - - information - -
    - Subject: - - ebook, epublishing, electronic book, electronic publishing, electronic document, electronic citation, data structure, citation systems, search - -
    - Date created: - - 2002-11-12 - -
    - Date issued: - - 2002-11-12 - -
    - Date available: - - 2002-11-12 - -
    - Date modified: - - 2007-08-30 - -
    - Date: - - 2007-08-30 - -
    - - - - 1 - - SiSU - Description,
    Ralph Amissah -
    -
    - - 2 - - SiSU an attempt to describe - - - - 3 - - 1. Description - - - - 4 - - 1.1 Outline - - - - 5 - - SiSU is a flexible document preparation, generation publishing -and search system.1 - - - 1 - - This information was first placed on the web 12 November 2002; with -predating material taken from <http://www.jus.uio.no/lm/lm.information/toc.html> -part of a site started and developed since 1993. See document metadata -section <http://www.jus.uio.no/sisu/SiSU/metadata.html> -for information on this version. Dates related to the development of -SiSU are mostly contained within the Chronology section of this -document, e.g. <http://www.jus.uio.no/sisu/sisu_chronology> - - - - - 6 - - SiSU ("SiSU information Structuring Universe" or -"Structured information, Serialized Units"),2 is a Unix -command line oriented framework for document structuring, publishing -and search. Featuring minimalistic markup, multiple standard outputs, a -common citation system, and granular search. - - - 2 - - also chosen for the meaning of the Finnish term "sisu". - - - - - 7 - - Using markup applied to a document, SiSU can produce plain text, -HTML, XHTML, XML, OpenDocument, LaTeX or PDF files, and populate an SQL -database with objects3 (equating generally to paragraph-sized -chunks) so searches may be performed and matches returned with that -degree of granularity (e.g. your search criteria is met by these -documents and at these locations within each document). Document output -formats share a common object numbering system for locating content. -This is particularly suitable for "published" works (finalized texts as -opposed to works that are frequently changed or updated) for which it -provides a fixed means of reference of content. - - - 3 - - objects include: headings, paragraphs, verse, tables, images, but not -footnotes/endnotes which are numbered separately and tied to the object -from which they are referenced. - - - - - 8 - - SiSU is the data/information structuring and transforming tool, -that has resulted from work on one of the oldest law web projects. It -makes possible the one time, simple human readable markup of documents, -that SiSU can then publish in various forms, suitable for -paper4, web5 and relational database6 -presentations, retaining common data-structure and meta-information -across the output/presentation formats. Several requirements of legal -and scholarly publication on the web have been addressed, including the -age old need to be able to reliably cite/pinpoint text within a -document, to easily make footnotes/endnotes, to allow for semantic -document meta-tagging, and to keep required markup to a minimum. These -and other features of interest are listed and described below. A few -points are worth making early (and will be repeated a number of times): - - - 4 - - pdf via LaTeX or lout - - - - 5 - - currently html (two forms of html presentation one based on css the -other on tables), and PHP; potentially structured XML - - - - 6 - - any SQL - currently PostgreSQL and sqlite (for portability, -testing and development) - - - - - 9 - - (i) The SiSU document generator was the first to place -material on the web with a system that makes possible citation across -different document types, with paragraph, or rather object citation -numbering7 a text positioning system, available for the -pinpointing of text, 1997, a simple idea from which much benefit, and -SiSU remains today, to the best of my knowledge, the only -multiple format e-book/ electronic-document system on the web that -gives you this possibility (including for relational databases). - - - 7 - - previously called "text object numbering" - - - - - 10 - - (ii) Markup is done once for the multiple formats produced. - - - - 11 - - (iii) Markup is simple, and human readable (with a little -practice), in almost all cases there is less and simpler markup -required than basic html. In any event the markup required is very much -simpler than the html, LaTeX, [lout], structured XML, ODF -(OpenDocument), PostgreSQL or SQLite feed etc. that you can have -SiSU generate for you. - - - - 12 - - (iv) SiSU is a batch processor, dealing with as many files -as you need to generate at a time. - - - - 13 - - (v) Scalability is dependent on your file system (in my case -Reiserfs), the database (currently Postgresql and/or SQLite) and your -hardware. - - - - 14 - - SiSU Sabaki8 (or just SiSU) is the provisional -name given to the software described here that helps structure -documents for web and other publication. The name SiSU is a -loose anagram for something along the lines of "SiSU is -structuring unit", or "SiSU, information structuring -unit" or the more descriptive "Structured information, -Serialized Units" or "simple - information structuring -unit" or the more descriptive "Structured information, -Serialized Units" or what it may be directed towards -"semantic and information structuring universe" -,9 tongue in cheek, only just. Guess I'll get away with -"Simple - information Structuring Universe". SiSU -is also a Finnish word roughly meaning guts, inner strength and -perseverance.10 - - - 8 - - SiSU Sabaki, release version. Pre-release version SiSU -Scribe, and version prior to that SiSU nicknamed Scribbler. -Pre-release versions go back several years. Both Scribbler and Scribe -(still maintained) made system calls to SiSU's various parts, -instead of using libraries. - - - - 9 - - A little universe it may be, but semantic you may have a hard time -getting away with, given the meaning the word has taken on with markup. -On a document wide basis semantic information may be provided, which -can be really useful, (and meaningful, especially) if you have a large -document set, and use this with rss feeds or in an sql database etc. On -a markup level, I have little inclination to add semantic markup -formally beyond references, title, author [Dublin Core entities? -addresses?] etc. Actually this deserves a bit of thought possibly use -letter tags (including letter alias/synonyms for font faces) to create -a small set of default semantic tags, with the possibility for per -document adjustments. Will seek to permit XML entity tagging, within -SiSU markup and have that ignored/removed by the parts of the -program that have no use for it. - - - - 10 - - "Sisu refers not to the courage of optimism, but to a concept of -life that says, 'I may not win, but I will gladly give my life for what -I believe.'" Aini Rajanen, Of Finnish Ways, 1981, p. 10.
    -<http://www.humanlanguages.com/finnishenglish/rlfs.htm> -
    "Every Finn has his own pet definition. To me, sisu means -patience without passion. But there are many varieties of sisu. Sisu -can be a sudden outburst or it can be the kind that lasts. A man can -have both kinds. It is outside reason. It is something in the soul. It -comes from oneself. For instance, it makes a soldier do things because -he himself must, not because he has been told." Paavo Nurmi
    -<http://personalweb.smcvt.edu/tmatikainen/finnishtraditions.htm> -
    -
    -
    - - 15 - - SiSU was born of the need to find a way, with minimal effort, -and for as wide a range of document types as possible, to produce high -quality publishing output in a variety of document formats. As such it -was necessary to find a simple document representation that would work -across a large number of document types, and the most convenient way(s) -to produce acceptable output formats. The project leading to this -program was started in 1993 (together with the trade law project now -known as Lex Mercatoria) as an investigation of how to -effectively/efficiently place documents on the web. The unified -document handling, together with features such as paragraph numbering, -endnote handling and tables... appeared in 1996/97. SiSU was -originally written in Perl,11 and converted to Ruby, -12 in 2000, one of the most impressive programming languages -in existence! In its current form it has been written to run on the -Gnu /Linux platform, and in particular on Debian, -13 taking advantage of many of the wonderful projects that are -available there. - - - 11 - - <http://www.perl.org/> - - - - 12 - - <http://www.ruby-lang.org/en/> - - - - 13 - - <http://www.debian.org/> - - - - - 16 - - SiSU markup is based on requiring the minimum markup needed to -determine the structure of a document. (This can be as little as saying -in a header to look for the word Book at a specified level and the word -Chapter at another level). SiSU then breaks a document into its -smallest parts (at a heading, and paragraph level) while retaining all -structural information. This break up of the document and information -on its structure is taken advantage of in the transformations made in -generating the very different output types that can be created, and in -providing as much as can be for what each output type is best at doing, -e.g. LaTeX (professional document typesetting, easy conversion to pdf -or Postscript), XML (in this case, structural representation), ODF -(OpenDocument [experimental]), SQL (e.g. document search; representing -constituent parts of documents based on their structure, headings, -chapters, paragraphs as required; user control).14 - - - 14 - - where explicit structure is provided through the use of tagging -headings, it could be reduced (still) further, for example by reducing -the number of characters used to identify heading levels; but in many -cases even that information is not required as regular expressions can -be used to extract the implicit structure. - - - - - 17 - - From markup that is simpler and more sparse than html you get: - - - - 18 - - far greater output possibilities, including html, XML, ODF -(OpenDocument), LaTeX (pdf), and SQL; - - - - 19 - - the advantages implicit in the very different output possibilities; - - - - 20 - - a common citation system (for all outputs - including the relational -database, search results are relevant for all outputs); - - - - 21 - - For more see the short summary of features provided below. - - - - 22 - - SiSU processes files with minimal tagging to produce various -document outputs including html, LaTeX or lout (which is converted to -pdf) and if required loads the structured information into an SQL -database (PostgreSQL and SQLite have been used for this). SiSU -produces an intermediate processing format.15 - - - 15 - - This proved to be the easiest way to develop syntax, changes could -be made, or alternatives provided for the markup syntax whilst the -intermediate markup syntax was largely held constant. There is actually -an optional second intermediate markup format in YAML <http://www.yaml.org/> - - - - - 23 - - SiSU is used in constructing Lex Mercatoria <http://lexmercatoria.org/> -or <http://www.jus.uio.no/lm/> -(one of the oldest law web sites), and considerable thought went into -producing output that would be suitable for legal and academic writings -(that do not have formulae) given the limitations of html, and -publication in a wide variety of "formats", in particular in relation -to the convenient and accurate citation of text. However, the -construction of Lex Mercatoria uses only a fraction of the features -available from SiSU today, vis generation of flat file -structures, rather than in addition the building of ("granular") SQL -database content, (at an object level with relevant relational tables, -and other outputs also available). - - - - 24 - - 1.2 Short summary of features - - - - 25 - - (i) markup syntax: (a) simpler than html, (b) mnemonic, -influenced by mail/messaging/wiki markup practices, (c) human readable, -and easily writable, - - - - 26 - - (ii) (a) minimal markup requirement, (b) single file marked up -for multiple outputs, - - - - 27 - - notes: - - - - 28 - - * documents are prepared in a single UTF-8 file using a minimalistic -mnemonic syntax. Typical literature, documents like "War and Peace" -require almost no markup, and most of the headers are optional. - - - - 29 - - * markup is easily readable/parsed by the human eye, (basic markup is -simpler and more sparse than the most basic html), [this may also be -converted to XML representations of the same input/source document]. - - - - 30 - - * markup defines document structure (this may be done once in a header -pattern-match description, or for heading levels individually); basic -text attributes (bold, italics, underscore, strike-through etc.) as -required; and semantic information related to the document (header -information, extended beyond the Dublin core and easily further -extended as required); the headers may also contain processing -instructions. - - - - 31 - - (iii) (a) multiple outputs primarily industry established and -institutionally accepted open standard formats, include amongst others: -plaintext (UTF-8); html; (structured) XML; ODF (Open Document text)l; -LaTeX; PDF (via LaTeX); SQL type databases (currently PostgreSQL and -SQLite). Also produces: concordance files; document content -certificates (md5 or sha256 digests of headings, paragraphs, images -etc.) and html manifests (and sitemaps of content). (b) takes advantage -of the strengths implicit in these very different output types, (e.g. -PDFs produced using typesetting of LaTeX, databases populated with -documents at an individual object/paragraph level, making possible -granular search (and related possibilities)) - - - - 32 - - (iv) outputs share a common numbering system (dubbed "object -citation numbering" (ocn)) that is meaningful (to man and machine) -across various digital outputs whether paper, screen, or database -oriented, (PDF, html, XML, sqlite, postgresql), this numbering system -can be used to reference content. - - - - 33 - - (v) SQL databases are populated at an object level (roughly -headings, paragraphs, verse, tables) and become searchable with that -degree of granularity, the output information provides the -object/paragraph numbers which are relevant across all generated -outputs; it is also possible to look at just the matching paragraphs of -the documents in the database; [output indexing also work well with -search indexing tools like hyperesteier]. - - - - 34 - - (vi) use of semantic meta-tags in headers permit the addition of -semantic information on documents, (the available fields are easily -extended) - - - - 35 - - (vii) creates organised directory/file structure for -(file-system) output, easily mapped with its clearly defined structure, -with all text objects numbered, you know in advance where in each -document output type, a bit of text will be found (e.g. from an SQL -search, you know where to go to find the prepared html output or PDF -etc.)... there is more; easy directory management and document -associations, the document preparation (sub-)directory may be used to -determine output (sub-)directory, the skin used, and the SQL database -used, - - - - 36 - - (viii) "Concordance file" wordmap, consisting of all the words -in a document and their (text/ object) locations within the text, (and -the possibility of adding vocabularies), - - - - 37 - - (ix) document content certification and comparison -considerations: (a) the document and each object within it stamped with -an md5 hash making it possible to easily check or guarantee that the -substantive content of a document is unchanged, (b)version control, -documents integrated with time based source control system, default RCS -or CVS with use of $Id: sisu_description.sst,v 1.25 2007/08/23 12:22:36 -ralph Exp $ tag, which SiSU checks - - - - 38 - - (x) SiSU's minimalist markup makes for meaningful -"diffing" of the substantive content of markup-files, - - - - 39 - - (xi) easily skinnable, document appearance on a project/site -wide, directory wide, or document instance level easily -controlled/changed, - - - - 40 - - (xii) in many cases a regular expression may be used (once in -the document header) to define all or part of a documents structure -obviating or reducing the need to provide structural markup within the -document, - - - - 41 - - (xiii) prepared files may be batch process, documents produced -are static files so this needs to be done only once but may be repeated -for various reasons as desired (updated content, addition of new output -formats, updated technology document presentations/representations) - - - - 42 - - (xiv) possible to pre-process, which permits: the easy creation -of standard form documents, and templates/term-sheets, or; building of -composite documents (master documents) from other sisu marked up -documents, or marked up parts, i.e. import documents or parts of text -into a main document should this be desired - - - - 43 - - there is a considerable degree of future-proofing, output -representations are "upgradeable", and new document formats may be -added. - - - - 44 - - (xv) there is a considerable degree of future-proofing, output -representations are "upgradeable", and new document formats may be -added: (a) modular, (thanks in no small part to Ruby) another -output format required, write another module.... (b) easy to update -output formats (eg html, XHTML, LaTeX/PDF produced can be updated in -program and run against whole document set), (c) easy to add, modify, -or have alternative syntax rules for input, should you need to, - - - - 45 - - (xvi) scalability, dependent on your file-system (ext3, -Reiserfs, XFS, whatever) and on the relational database used (currently -Postgresql and SQLite), and your hardware, - - - - 46 - - (xvii) only marked up files need be backed up, to secure the -larger document set produced, - - - - 47 - - (xviii) document management, - - - - 48 - - (xix) Syntax highlighting for SiSU markup is available -for a number of text editors. - - - - 49 - - (xx) remote operations: (a) run SiSU on a remote server, -(having prepared sisu markup documents locally or on that server, i.e. -this solution where sisu is installed on the remote server, would work -whatever type of machine you chose to prepare your markup documents -on), (b) generated document outputs may be posted by sisu to remote -sites (using rsync/scp) (c)document source (plaintext utf-8) if shared -on the net may be identified by its url and processed locally to -produce the different document outputs. - - - - 50 - - (xxi) document source may be bundled together (automatically) -with associated documents (multiple language versions or master -document with inclusions) and images and sent as a zip file called a -sisupod, if shared on the net these too may be processed locally to -produce the desired document outputs, these may be downloaded, shared -as email attachments, or processed by running sisu against them, either -using a url or the filename. - - - - 51 - - (xxii) for basic document generation, the only software -dependency is Ruby, and a few standard Unix tools (this covers -plaintext, html, XML, ODF, LaTeX). To use a database you of course need -that, and to convert the LaTeX generated to PDF, a LaTeX processor like -tetex or texlive. - - - - 52 - - as a developers tool it is flexible and extensible - - - - 53 - - SiSU was developed in relation to legal documents, and is strong -across a wide variety of texts (law, literature...). SiSU -handles images but is not suitable for formulae/ statistics, or for -technical writing at this time. - - - - 54 - - SiSU has been developed and has been in use for several years. -Requirements to cover a wide range of documents within its use domain -have been explored. - - - - 55 - - Some modules are more mature than others, the most mature being Html -and LaTeX / pdf. PostgreSQL and search functions are useable and -together with ocn unique (to the best of my knowledge). The XML -output document set is "well formed" but largely proof of concept. - - - - 56 - - 1.3 How it works - - - - 57 - - SiSU markup is fairly minimalistic, it consists of: a (largely -optional) document header, made up of information about the document -(such as when it was published, who authored it, and granting what -rights) and any processing instructions; and markup within text which -is related to document structure and typeface. SiSU must be able -to discern the structure of a document, (text headings and their levels -in relation to each other), either from information provided in the -instruction header or from markup within the text (or from a -combination of both). Processing is done against an abstraction of the -document comprising of information on the document's structure and its -objects,16 which the program serializes (providing the object -numbers) and which are assigned hash sum values based on their content. -This abstraction of information about document structure, objects, (and -hash sums), provides considerable flexibility in representing documents -different ways and for different purposes (e.g. search, document -layout, publishing, content certification, concordance etc.), and makes -it possible to take advantage of some of the strengths of established -ways of representing documents, (or indeed to create new ones). - - - 16 - - objects include: headings, paragraphs, verse, tables, images, but -not footnotes/endnotes which are numbered separately and tied to the -object from which they are referenced. - - - - - 58 - - 1.4 Simple markup - - - - 59 - - SiSU markup is based on requiring the minimum markup needed to -determine the structure of a document. (This can be as little as saying -in a header to look for the word Book at a specified level and the word -Chapter at another level). SiSU then breaks a document into its -smallest parts (at a heading, and paragraph level) while retaining all -structural information. This break up of the document and information -on its structure is taken advantage of in the transformations made in -generating the very different output types that can be created, and in -providing as much as can be for what each output type is best at doing, -e.g. LaTeX (professional document typesetting, easy conversion to pdf -or Postscript), XML (in this case, structural representation), ODF -(OpenDocument), SQL (e.g. document search; representing constituent -parts of documents based on their structure, headings, chapters, -paragraphs as required; user control).17 - - - 17 - - where explicit structure is provided through the use of tagging -headings, it could be reduced (still) further, for example by reducing -the number of characters used to identify heading levels; but in many -cases even that information is not required as regular expressions can -be used to extract the implicit structure. - - - - - 60 - - 1.4.1 Sparse markup requirement, try to get the most out of markup - - - - 61 - - One of its strengths is that very small amounts of initial tagging is -required for the program to generate its output. - - - - 62 - - This is a basic markup example: - - - - 63 - - -basic markup example, text file - an international convention -18 - - - 18 - - <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> -output provided as example in the next section - - - - - 64 - - -view basic markup, as it would be highlighted by vim editor -19 - - - 19 - - <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> -as it would appear with syntax highlighting (by vim) - - - - - 65 - - Emphasis has been on simplicity and minimalism in markup requirements. -Design philosophy is to try keep the amount of markup required low, for -whatever has been determined to be acceptable output.20 - - - 20 - - seems there are several "smart ASCIIs" available, primarily for -ascii to html conversion, that make this, and reasonable looking ascii -their goal
    <http://webseitz.fluxent.com/wiki/SmartAscii> -
    <http://daringfireball.net/projects/markdown/> -
    <http://www.textism.com/tools/textile/> -
    -
    -
    - - 66 - - SiSU's markup is more minimalistic and simpler than (the -equivalent) html and for it, you get considerably more than just html, -as this preparation gives you all available output formats, upon -request. - - - - 67 - - 1.4.2 Single markup file provides multiple output formats - - - - 68 - - For each document, there is only one (input, minimalistically marked -up) file from which all the available output types are -generated.21 - - - 21 - - These include richly laid out and linked html (table or css -variants), PHP, LaTeX (from which pdf portrait and landscape -documents are produced), texinfo (for info files etc.), and PostgreSQL -and/or SQLite. And the opportunity to fairly easily build additional -modules, such as XML. See the examples provided in this document. - - - - - 69 - - Eg. the markup example: - - - - 70 - - -original text file - an international convention 22 - - - 22 - - <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> - - - - - 71 - - -view as syntax would be highlighted by vim editor 23 - - - 23 - - <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> - - - - - 72 - - Produces the following output: - - - - 73 - - -Segmented html version of document 24 - - - 24 - - <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html> - - - - - 74 - - -Full length html document 25 - - - 25 - - <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/doc.html> - - - - - 75 - - -pdf landscape version of document 26 - - - 26 - - <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/landscape.pdf> - - - - - 76 - - -pdf portrait version of document 27 - - - 27 - - <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/portrait.pdf> - - - - - 77 - - -clean tex ascii version of document 28 - - - 28 - - <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/plain.txt> - - - - - 78 - - -xml sax version of document 29 - - - 29 - - <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/sax.xml> - - - - - 79 - - -xml dom version of document 30 - - - 30 - - <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/dom.xml> - - - - - 80 - - -Concordance 31 - - - 31 - - <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/concordance.html> - - - - - 81 - - (and in addition to these: PostgreSQL, SQLite, texinfo and -YAML 32 versions if desired) - - - 32 - - discontinued for the time being - - - - - 82 - - 1.4.3 Syntax relatively easy to read and remember - - - - 83 - - Syntax is kept simple and mnemonic.33 - - - 33 - - SiSU markup syntax, an incomplete summary: <http://www.jus.uio.no/sisu/sisu_markup_table/doc.html#h200306> -
    Visual check of elementary font face modifiers: bold -bold emphasis italics underscore -strikethrough superscript subscript -
    -
    -
    - - 84 - - 1.4.4 Kept simple by having a limited publishing feature set, and -features identified as most important, are available across several -document types - - - - 85 - - To keep SiSU markup sparse and simple SiSU deliberately -provides a limited publishing feature set, including: indent levels; -bold; italics; superscript; subscript; simple tables; images; tables of -contents and; endnotes. Which in most cases are available across the -different output formats. - - - - 86 - - The publishing feature set may be expanded as required. - - - - 87 - - 1.5 Designed with usability in mind - - - - 88 - - Output is designed to be uniform, easy to read, navigate and cite. - - - - 89 - - 1.6 Code separate from content - - - - 90 - - Code34 is separated from content. This means that when changes -are desired in the output presentation, the code that produces them, -and not the marked up text data set (which could be thousands of -documents) is modified. Separating code from content makes large scale -changes to output appearance trivial, and permits the easy addition of -new output modules. - - - 34 - - the program that generates the documents - - - - - 91 - - 1.7 Object citation numbering, a text or object positioning / citation -system - "paragraph" (or text object) numbering, that remains same and -usable across all output formats by people and machine - - - - - 92 - - Object citation numbering is a simple object (text) positioning and -cition system that is human relevant and machine useable, used by -SiSU for all manner of presentations, and that is available for -use in all text mappings. It is based on the automated sequential -numbering of objects (roughly paragraphs, (headings, tables, verse) or -other blocks of text or images etc.). The text positioning system (in -which I claim copyright) is invaluable for publishing requiring the -citing text across multiple output formats, and for the general mapping -of text within a document: - - - - 93 - - in html, html not being easily citeable (change font size, or use a -different browser and the page on which specific text appears has -changed), and - - - - 94 - - across multiple formats being common to all output formats -html/xml/pdf/sql output, - - - - 95 - - the results of an sql search can just be "live" citation references to -the documents in which the text is found, much like -an index (see image examples provided). 35 - - - 35 - - <http://www.jus.uio.no/sisu/SiSU/1.html#search> - - - - - 96 - - I claim copyright on the system I use which is the most basic of all, -numbering all text in headings and paragraphs sequentially (with tables -and images being treated as a single paragraph) and only -footnotes/endnotes not following this numbering, as their position in -text is not strictly determined, (a change from footnotes to endnotes -would change their numbering), footnotes instead "belong" to the -paragraph from which they are referenced, and have sequential numbers -of their own. - - - - 97 - - SiSU has a paragraph numbering system, that remains the same -regardless of the output format. This provides an effective means of -citation, pinpointing text accurately in all output formats, using the -same reference. This is particularly useful where text has to be -located across different output formats - for example once html is -printed the number of pages and pages on which given text is found will -vary depending on the browser, its settings the font size setting etc. -Similarly SiSU produces pdf in different forms, eg. on the -example site Lex Mercatoria as portrait and landscape documents - here -too page numbering varies, but paragraph numbering is the same, vis -a vis all versions of the text (portrait and landscape pdf and the -html versions of the text, and as stored (with "paragraphs" as records) -to the PostgreSQL or SQLite database). - - - - 98 - - These numbers are placed in the text margins and are intended to be -independent of and not to interfere with authors tagging. [The citation -system (object citation numbering system, automated "paragraph -numbering") which is automatically generated and is common and -identical across all document formats] The paragraph numbering system -is more accurately described as an (text) object numbering system, as -headings are also numbered... all headings and paragraphs are numbered -sequentially. Endnotes are automatically numbered independently and -rather "belong" to the paragraph from which they are referenced, as an -endnote does not (necessarily) form a part of a documents sequence, -(they may be produced as either endnotes or footnotes (or both -depending on what output you choose to look at - if you take the -segmented html version document provided as an example, you will find -that the endnotes are placed both at the end of each section, and in a -separate section of their own called endnotes, and these are -hyper-linked)). An attractive feature of providing citation numbering -in this way is that it is independent of the document structure... it -remains the same regardless of what is done about the document -structure. - - - - 99 - - The rules have been kept very simple, unique incremental object -citation numbers are assigned to headings, paragraphs, verse, tables -and images. It is possible to manually override this feature on a per -heading or comment basis though this should be used exceptionally, it -may be of use where there a substantive text, and the addition of a -minor comment by the publisher that should not be mapped as part of the -text. - - - - 100 - - The object citation number markers contain additional numbering -information with regard to the document structure, that can be used for -alternative presentations, including such detail as the type of object -(heading, paragraph, table, image, etc.), numbered sequentially. - - - - 101 - - An advantage is that the numbering remains the same regardless of -document structure. - - - - 102 - - Text object ("paragraph") numbering is the same for all output versions -of the same document, vis html, pdf, pgsql, yaml etc. - - - - 103 - - In the relational database, as individual text objects of a document -stored (and indexed) together with object numbers, and all versions of -the document have the same numbering, the results of searches may be -tailored just to provide the location of the search result in all -available document formats. - - - - 104 - - Note: there is a bug in the released behaviour of object citation -numbering, (not certain when it was introduced) tables should be -numbered, ie each table gets an ocn, required amongst other things for -relational database. This will be corrected in a future release. -Citation numbering of existing documents that contain tables will -changed. - - - - 105 - - 1.8 Handling of Dublin Core meta-tags making use of the Resource -Description Framework - - - - 106 - - SiSU is able to use meta tags based on the Dublin -Core36 and Resource Description Framework37 - - - 36 - - <http://dublincore.org/> - - - - 37 - - <http://www.w3.org/RDF/> - - - - - 107 - - This provides the means of providing semantic information about a -document, both as computer processable meta-tags, and as human readable -information that may be of value for classification purposes. - - - - 108 - - This information is provided both in html metatags, and (where -available) under the section titled "Document Information - MetaData", -near the end of a document, for example in the segmented html version -of this text at: <http://www.jus.uio.no/sisu/SiSU/metadata.html> - - - - 109 - - 1.9 Easy directory management - - - - 110 - - 1. Directory file association, skins and special image management, made -simpler.38 - - - 38 - - The previous way was directory associations for file output were set -up in the configuration file. The present system is a more natural way -to work requireing less configuration. - - - - - 111 - - The last part of the name of the work directory in which markup is -being done, or rather from where SiSU is run in order to -generate document output, is used in determining the sub-directory name -for output files, that is created in the document output directory. -This provides a rather easy way to associate documents e.g. of a given -subject, or by owner. - - - - 112 - -      /www/docs
             /intellectual_property
             /arbitration
             /contract_law

         /www/docs
             /ralph
             /sisu     -
    -
    - - 113 - - all are placed in their own directories within the directory structure -created. Similar rules are used in the creation of sql type databases -(though they can be overridden). - - - - 114 - - There are a couple of further associations with these directories. - - - - 115 - - Directory wide skins. - - - - 116 - - Directory specific images. - - - - 117 - - 2. If there is a "directory skin", that is a skin of the same name as -the directory, it is used in the generation of the documents within it, -rather than the default skin, unless the document has a specific skin -associated with it. - - - - 118 - - a. default skin (always available) - - - - 119 - - b. directory skin (precedence over default if exists) - - - - 120 - - c. document skin (takes precedence wherever document requests a -specific skin) - - - - 121 - - Skins are defined in the document skin directory and if a directory -association is desired a softlink made to the relevant skin. Skins -(directory association auto load) auto load skin if a directory skin -exists of same name as directory stub, (and there is no specific doc -skin) - - - - 122 - - 3. If the working directory has within it a sub-directory called -image_local, the images within that directory are used for references -to images, that are not part of the default site build. - - - - 123 - - 1.10 Document Version Control Information - - - - 124 - - The possibility of citing an exact document version. - - - - 125 - - Permits the inclusion of document version control information to the -document body and metatags.39 This provides a much more -certain method of referring to the exact version of a particular -document, (assuming that the document is from a trusted source, that -will retain earlier versions of a document).40 - - - 39 - - from a version control system such as CVS - - - - 40 - - The version control system must be run, so the version number is -obtained, prior to the SiSU document generation, and subsequent -posting of the document. - - - - - 126 - - This information (where available) is provided under the section of the -document titled "Document Information - MetaData", near the end of a -document, for example in the segmented html version of this text at: -<http://www.jus.uio.no/sisu/SiSU/metadata.html> - - - - 127 - - 1.11 Table of contents - - - - 128 - - SiSU produces a rudimentary a table of contents based on -document headings. - - - - 129 - - 1.12 Auto-numbering of headings - - - - 130 - - Headings can be automatically numbered, (and automatically named for -hyper-linking) - - - - 131 - - 1.13 Numbering and cross-hyperlinking of endnotes - - - - 132 - - SiSU can automatically number footnotes/endnotes. This is the -default operation where no number is provided. - - - - 133 - - Footnotes/endnotes may also be manually numbered. Where a number, or -numbers are provided for a footnote/endnote, this does not increment -the automatic footnote/endnote number counter. - - - - 134 - - In the html output footnotes/endnotes are cross-hyper-linked (to their -reference point and vice versa). In th pdf output footnotes are linked -from their reference point only. - - - - 135 - - 1.14 "Skinnable" - - - - 136 - - SiSU is skinnable, on a site-wide, directory-wide and per -document basis, so different looking versions of things may be produced -with little difficulty. There is a default skin which may be modified, -as the background site skin, and each working directory may have a skin -associated with it, as may each individual document. The hierarchy of -application is document, directory, then site... ie if a document skin -exists it gets precedence. - - - - 137 - - Whilst it is skinnable, the default output styles are selected to work -across the widest possible range of document types. - - - - 138 - - 1.15 Multiple Outputs - - - - 139 - - From markup that is simpler and more sparse than html you get: - - - - 140 - - far greater output possibilities, including multiple html types, XML -(different structured types), LaTeX (pdf landscape, portrait), and SQL -(Postgresql or SQLite or other); - - - - 141 - - the advantages implicit in these very different output -possibilities;41 - - - 41 - - e.g. LaTeX (professional document typesetting, easy conversion to -pdf or Postscript), XML (in this case, structural representation), SQL -(e.g. document set searches; representation of the constituent parts of -documents based on their structure, headings, chapters, paragraphs as -desired; control of use) - - - - - 142 - - a common citation system - - - - 143 - - As many output formats/presentations as one cares to write modules for -- several types of html (e.g. structure based on css, or structure -based on tables); LaTeX/pdf and Lout/pdf; pgsql other -databases easily added; yaml... - - - - 144 - - 1.15.1 html - several presentations: full length & segmented; css -& table based - - - - 145 - - Most documents are produced in single and segmented html versions, -described below: - - - - 146 - - The Scroll (full length text presentations) - - - - 147 - - The full length of the text in a single scrollable document.42 -As a rule the files they are saved in are named: doc or more -precisely doc.html - - - 42 - - CISG <http://www.jus.uio.no/lm/un_contracts_international_sale_of_goods_convention_1980/doc> -
    The Unidroit Contract Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994/doc> -or
    The Autonomous Contract <http://www.jus.uio.no/lm/autonomous.contract.2000.amissah/doc> -
    -
    -
    - - 148 - - For various reasons texts may only be provided in this form (such as -this one which is short), though most are also provided as segmented -texts. - - - - 149 - - "Scroll" is a reference to the historical scroll, a single long -document/ parchment, and also no doubt to what you will have to do to -get to the bottom of the text.43 - - - 43 - - Scrolling is not however necessarily confined to full length -documents as you will have to scroll to get to the bottom of any long -segment (eg. chapter) of a segmented text. - - - - - 150 - - The Segmented Text - - - - 151 - - The text divided into segments (such as articles or chapters depending -on the text)44 As a rule the files they are saved in are -named: toc and index or more precisely toc.html -and index.html - - - 44 - - CISG <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980> -
    The Unidroit Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994> -
    The Autonomous Contract <http://www.jus.uio.no/sisu/the.autonomous.contract.2000.amissah> -or
    WTA 1994 <http://www.jus.uio.no/lm/wta.1994> -
    -
    -
    - - 152 - - If you know exactly what you are looking for, loading a segment of text -is faster (the segments being smaller). Occasionally longer documents -such as the WTA 1994 <http://www.jus.uio.no/lm/wta.1994/toc> -are only provided in segmented form. - - - - 153 - - Cascading Style Sheet, and Table based html - - - - 154 - - SiSU outputs html, two current standard forms available are: - - - - 155 - - css based - - - - - 156 - - and - - - - 157 - - table based [largely discontinued ]45 - - - 45 - - formatting possibility still exists in code tree but maintenance has -been largely discontinuted. - - - - - 158 - - The html is tested across several browsers - - - - 159 - - I like to remind you that there are other excellent browsers out there, -many of which have long supported practical features like tabbing. - - - - 160 - - The html is tested across several browsers, including: - - - - 161 - - Firefox -(Mozilla-Firefox) 46 - - - 46 - - <http://www.mozilla.org/products/firefox/> - - - - - 162 - - Kazehakase -47 - - - 47 - - <http://kazehakase.sourceforge.jp/> - - - - - 163 - - Konqueror 48 - - - 48 - - <http://www.konqueror.org/> - - - - - 164 - - Mozilla 49 - - - 49 - - <http://www.mozilla.org/> - - - - - 165 - - MS -Internet Explorer 50 - - - 50 - - <http://www.microsoft.com/windows/ie/default.asp> - - - - - 166 - - -Netscape 51 - - - 51 - - <http://home.netscape.com/comprod/mirror/client_download.html> - - - - - 167 - - Opera 52 - - - 52 - - <http://www.opera.com/> - - - - - 168 - - Also lighter weight graphical browsers: - - - - 169 - - Dillo 53 - - - 53 - - <http://www.dillo.org/> - - - - - 170 - - Epiphany - 54 - - - 54 - - <http://www.gnome.org/projects/epiphany/> - - - - - 171 - - Galeon -55 - - - 55 - - <http://galeon.sourceforge.net/> - - - - - 172 - - And for console/text browsing: - - - - 173 - - elinks 56 - - - 56 - - <http://elinks.or.cz/> - - - - - 174 - - links2 -57 - - - 57 - - <http://links.twibright.com/> - - - - - 175 - - w3m -58 - - - 58 - - <http://w3m.sourceforge.net/> - - - - - 176 - - The html tables output is rendered more accurately across a wider -variety set and older versions of browsers (than the html css output). - - - - 177 - - 1.15.2 XML - - - - 178 - - SiSU generates well formed XML, and multiple versions. An XML -SAX version with a flat/shallow structure, and XML DOM version with a -deeper (embedded) structure. There is also a released working xhtml -module. Examples of SAX and DOM versions are provided within this -document. - - - - 179 - - 1.15.3 ODT:ODF, Open Document Format - ISO/IEC 26300:2006 - - - - 180 - - SiSU generates Open Document Output format. - - - - 181 - - 1.15.4 PDF - portrait and landscape, (through the generation of LaTeX -output which is then transformed to pdf) - - - - 182 - - SiSU outputs LaTeX if required which is easily transformed to -PDF.59 PDF documents are generated on the site from the same -source files and Ruby program that produce html. Landscape -oriented pdf introduced, providing easier screen viewing, they are also -(paper saving, being currently) formatted to have fewer pages than -their portrait equivalents. - - - 59 - - LaTeX and pdf features introduced 18th June 2001, -Landscape and portrait pdfs introduced 7th October 2001., -Lout is a more recent addition 22th April 2003 - - - - - 183 - - -Adobe Reader 60 - - - 60 - - <http://www.adobe.com/products/acrobat/readstep2.html> - - - - - 184 - - Evince - 61 - - - 61 - - <http://www.gnome.org/projects/evince/> - - - - - 185 - - xpdf 62 - - - 62 - - <http://www.foolabs.com/xpdf/> - - - - - 186 - - 1.15.5 Search - loading/populating of relational database while -retaining document structure information, object citation numbering and -other features (currently PostgreSQL and/or SQLite) - - - - 187 - - SiSU (from the same markup input file) automatically feeds into -PostgreSQL63 and/or SQLite64 database (could be any -other of the better relational databases)65 - together with -all additional information related to document structure, and the -alternative ways in which it is generated on the site retained. As -regards scaling of the database, it is as scalable as the database -(here Postgresql or SQLite) and hardware allow. I will prune the images -later. - - - 63 - - <http://www.postgresql.org/> -
    <http://advocacy.postgresql.org/> -
    <http://en.wikipedia.org/wiki/Postgresql> -
    -
    - - 64 - - <http://www.hwaci.com/sw/sqlite/> -
    <http://en.wikipedia.org/wiki/Sqlite> -
    -
    - - 65 - - Relational database features retaining document structure and -citation introduced 15th July 2002 - - -
    - - 188 - - This is one of the more interesting output forms, as all the structural -data for the documents are retained (though can be ignored by the user -of the database should they so choose). All site texts/documents are -(currently) streamed to four pgsql database tables: - - - - 189 - - one containing semantic (and other) headers, including, title, -author, subject, (the Dublin Core...); - - - - 190 - - another the substantive texts by individual "paragraph" (or -object) - along with structural information, each paragraph being -identifiable by its paragraph number (if it has one which almost all of -them do), and the substantive text of each paragraph quite naturally -being searchable (both in formatted and clean text versions for -searching); and - - - - 191 - - a third containing endnotes cross-referenced back to the -paragraph from which they are referenced (both in formatted and clean -text versions for searching). - - - - 192 - - a fourth table with a one to one relation with the headers table -contains full text versions of output, eg. pdf, html, xml, and ascii. - - - - 193 - - There is of course the possibility to add further structures. - - - - 194 - - At this level SiSU loads a relational database with documents -broken in to their smallest logical structurally constituent parts, as -text objects, with their object citation number and all other -structural information needed to construct the structured document. -Text is stored (at this text object level) with and without elementary -markup tagging, the stripped version being so as to facilitate ease of -searching. - - - - 195 - - Because the document structure of sites created is clearly defined, and -the text object citation system is available for all forms of output, -it is possible to search the sql database, and either read results from -that database, or just as simply map the results to the html output, -which has richer text markup. - - - - 196 - - The combination of the SiSU citation system with a relational -database is pretty powerful, giving rise to several possibilities. As -individual text objects of a document stored (and indexed) together -with object numbers, and all versions of the document have the same -numbering, complex searches can be tailored to return just the -locations of the search results relevant for all available output -formats, with live links to the precise locations in the database or in -html/xml documents; or, the structural information provided makes it -possible to search the full contents of the database and have headings -in which search content appears, or to search only headings etc. (as -the Dublin Core is incorporated it is easy to make use of that as -well). - - - - 197 - - This is a larger scale project, (with little development on the front -end largely ignored), though the "infrastructure" has been in place -since 2002. - - - - 198 - - 1.15.6 Search - database frontend sample, utilising database and SiSU -features, including object citation numbering (backend currently -PostgreSQL) - - - - 199 - - Sample search frontend -66 A small database and sample query front-end (search from) -that makes use of the citation system, object citation numbering -to demonstrates functionality.67 - - - 66 - - <http://search.sisudoc.org> - - - - 67 - - (which could be extended further with current back-end). As regards -scaling of the database, it is as scalable as the database (here -Postgresql) and hardware allow. - - - - - 200 - - SiSU can provide information on which documents are matched and -at what locations within each document the matches are found. These -results are relevant across all outputs using object citation -numbering, which includes html, XML, LaTeX, PDF and indeed the SQL -database. You can then refer to one of the other outputs or in the SQL -database expand the text within the matched objects (paragraphs) in the -documents matched. - - - - 201 - - (further work needs to be done on the sample search form, which is -rudimentary and only passes simple booleans correctly at present to the -SQL engine) - - - - 202 - - A few canned searches, showing object numbers. Search for: - - - - 203 - - -English documents matching Linux OR Debian - - - - 204 - - -GPL OR Richard Stallman - - - - 205 - - -invention OR innovation in English language - - - - 206 - - -copyright in English language documents - - - - 207 - - Note that the searches done in this form are case sensitive. - - - - 208 - - Expand those same searches, showing the matching text in each document: - - - - 209 - - -English documents matching Linux OR Debian - - - - 210 - - -GPL OR Richard Stallman - - - - 211 - - -invention OR innovation in English language - - - - 212 - - -copyright in English language documents - - - - 213 - - Note you may set results either for documents matched and object number -locations within each matched document meeting the search criteria; or -display the names of the documents matched along with the objects -(paragraphs) that meet the search criteria.68 - - - 68 - - of this feature when demonstrated to an IBM software innovations -evaluator in 2004 he said to paraphrase: this could be of interest to -us. We have large document management systems, you can search hundreds -of thousands of documents and we can tell you which documents meet your -search criteria, but there is no way we can tell you without opening -each document where within each your matches are found. - - - - - 214 - - OCN index mode, (object citation number) the numbers displayed -are relevant (and may be used to reference the match) in any sisu -generated rendition of the text69 the links provided are to -the locations of matches within the html generated by SiSU. - - - 69 - - OCN are provided for HTML, XML, pdf ... though currently omitted in -plain-text and opendocument format output - - - - - 215 - - Paragraph mode, you may alternatively display the text of each -paragraph in which the match was made, again the object/paragraph -numbers are relevant to any SiSU generated/published text. - - - - 216 - - Several options for output - select database to search, show results in -index view (links to locations within text), show results with text, -echo search in form, show what was searched, create and show a "canned -url" for search, show available search fields. Also shows counters -number of documents in which found and number of locations within -documents where found. [could consider sorting by document with most -occurrences of the search result]. - - - - 217 - - Earlier version of the search frontend - Simple search, results with -files in which search found, and locations where found within files. - - - - 218 - - Simple search, results with files in which search found, and text -object (paragraph or endnote) where found within files. - - - - 219 - - 1.15.7 Other forms - - - - 220 - - There are other forms as well, YAML file, Ruby Marshal dumps, -document pre-processing (processing of documents prior to the steps -described here, to produce input suitable for the program) snap in a -new module as required/desired, well formed XML, no problem. - - - - 221 - - 1.16 Concordance / Word Map or rudimentary index - - - - 222 - - Concordance /WordMaps:70 SiSU produces a rudimentary -index based on the words within the text, making use of paragraph -numbers to identify text locations. This is generated in html and -hyper-linked but identifies these words locations in the other document -formats. Though it is possible to search using a search engine, this is -a means for browsing an alphabetical list of words which may suggest -other useful content. - - - 70 - - Concordance/ WordMaps introduced 15th August 2002 - - - - - 223 - - 1.17 Managed (document) directory, database, or site structure - - - - 224 - - SiSU builds the web site (or more generically provides a -suitable directory structure) - placing various output texts in the -hierarchy of the web-site (or db), which (for directories) is a -sub-directory with the name of the text file. - - - - 225 - - 1.18 Batch processing - - - - 226 - - SiSU is a batch processing tool, handling and transforming -multiple (or individual) documents (in many ways) with a single -instruction. - - - - 227 - - 1.19 Integration to superior Gnu/Linux and Unix tools - - - - 228 - - As should have been noted by the above description of SiSU, it -makes use of existing programs found on Gnu /Linux and Unix, -amongst those already mentioned include the LaTeX to pdf converters and -the database PostgreSQL or SQLite. - - - - 229 - - 1.19.1 Backup and version control - - - - 230 - - Unix provides many tools for version control. For documents Subversion, -CVS and even the old RCS are useful for the per-document histories they -provide. - - - - 231 - - For writing code superior (more recent) version control system exist. -These can also be used for documents though they tend to take stamps of -changes across the repository as a whole, rather than for each -individual file that is tracked, (as CVS and RCS do). My personal -preference is for distributed systems such as Git, Mercurial or Darcs, -of which I use Git for both code and documents. - - - - 232 - - Several backup tools exist. At the base level I tend to use rdiff. - - - - 233 - - 1.19.2 Editor support - - - - 234 - - SiSU documents are prepared / marked up in utf-8 text you are -free to use the text editor of your choice. - - - - 235 - - Syntax highlighting for a number of editors are provided. Amongst them -Vim, Kwrite, Kate, Gedit and diakonos. These may be found with -configuration instructions at <http://www.jus.uio.no/sisu/syntax_highlight>. - Vim 71 as of version -7 has built in sytax highlighting for SiSU. - - - 71 - - <http://www.vim.org/> - - - - - 236 - - 1.20 Modular design, need something new add a module - - - - 237 - - Need a new output format that does not already exist, write a new -module. - - - - 238 - - Prefer a new input syntax, you could write a new syntax matching the -existing design, though my personal preference is some uniformity in -entry appearance. If necessary has been fairly easy to extend the -design parameters. It is intended to incorporate some additional basic -semantic tagging, (book, article, author etc.) However, keeping the -requirements for input minimal, and relatively simple has been a design -goal. - - - - 0 - - Endnotes - - - -
    diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/scroll.xhtml b/data/doc/manuals_generated/sisu_manual/sisu_description/scroll.xhtml deleted file mode 100644 index beb9e0af..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/scroll.xhtml +++ /dev/null @@ -1,2519 +0,0 @@ - - - - - - - - Title: - - SiSU - Description - -
    - Creator: - - Ralph Amissah - -
    - Rights: - - Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3 - -
    - Type: - - information - -
    - Subject: - - ebook, epublishing, electronic book, electronic publishing, electronic document, electronic citation, data structure, citation systems, search - -
    - Date created: - - 2002-11-12 - -
    - Date issued: - - 2002-11-12 - -
    - Date available: - - 2002-11-12 - -
    - Date modified: - - 2007-08-30 - -
    - Date: - - 2007-08-30 - -
    - - - - - SiSU - Description,
    Ralph Amissah -
    - 1 -
    - - - SiSU an attempt to describe - - 2 - - - - 1. Description - - 3 - - - - 1.1 Outline - - 4 - - - - SiSU is a flexible document preparation, generation publishing -and search system.1 - - - 1. This information was first placed on the web 12 November 2002; with -predating material taken from <http://www.jus.uio.no/lm/lm.information/toc.html> -part of a site started and developed since 1993. See document metadata -section <http://www.jus.uio.no/sisu/SiSU/metadata.html> -for information on this version. Dates related to the development of -SiSU are mostly contained within the Chronology section of this -document, e.g. <http://www.jus.uio.no/sisu/sisu_chronology> - - 5 - - - - SiSU ("SiSU information Structuring Universe" or -"Structured information, Serialized Units"),2 is a Unix -command line oriented framework for document structuring, publishing -and search. Featuring minimalistic markup, multiple standard outputs, a -common citation system, and granular search. - - - 2. also chosen for the meaning of the Finnish term "sisu". - - 6 - - - - Using markup applied to a document, SiSU can produce plain text, -HTML, XHTML, XML, OpenDocument, LaTeX or PDF files, and populate an SQL -database with objects3 (equating generally to paragraph-sized -chunks) so searches may be performed and matches returned with that -degree of granularity (e.g. your search criteria is met by these -documents and at these locations within each document). Document output -formats share a common object numbering system for locating content. -This is particularly suitable for "published" works (finalized texts as -opposed to works that are frequently changed or updated) for which it -provides a fixed means of reference of content. - - - 3. objects include: headings, paragraphs, verse, tables, images, but not -footnotes/endnotes which are numbered separately and tied to the object -from which they are referenced. - - 7 - - - - SiSU is the data/information structuring and transforming tool, -that has resulted from work on one of the oldest law web projects. It -makes possible the one time, simple human readable markup of documents, -that SiSU can then publish in various forms, suitable for -paper4, web5 and relational database6 -presentations, retaining common data-structure and meta-information -across the output/presentation formats. Several requirements of legal -and scholarly publication on the web have been addressed, including the -age old need to be able to reliably cite/pinpoint text within a -document, to easily make footnotes/endnotes, to allow for semantic -document meta-tagging, and to keep required markup to a minimum. These -and other features of interest are listed and described below. A few -points are worth making early (and will be repeated a number of times): - - - 4. pdf via LaTeX or lout - - - 5. currently html (two forms of html presentation one based on css the -other on tables), and PHP; potentially structured XML - - - 6. any SQL - currently PostgreSQL and sqlite (for portability, -testing and development) - - 8 - - - - (i) The SiSU document generator was the first to place -material on the web with a system that makes possible citation across -different document types, with paragraph, or rather object citation -numbering7 a text positioning system, available for the -pinpointing of text, 1997, a simple idea from which much benefit, and -SiSU remains today, to the best of my knowledge, the only -multiple format e-book/ electronic-document system on the web that -gives you this possibility (including for relational databases). - - - 7. previously called "text object numbering" - - 9 - - - - (ii) Markup is done once for the multiple formats produced. - - 10 - - - - (iii) Markup is simple, and human readable (with a little -practice), in almost all cases there is less and simpler markup -required than basic html. In any event the markup required is very much -simpler than the html, LaTeX, [lout], structured XML, ODF -(OpenDocument), PostgreSQL or SQLite feed etc. that you can have -SiSU generate for you. - - 11 - - - - (iv) SiSU is a batch processor, dealing with as many files -as you need to generate at a time. - - 12 - - - - (v) Scalability is dependent on your file system (in my case -Reiserfs), the database (currently Postgresql and/or SQLite) and your -hardware. - - 13 - - - - SiSU Sabaki8 (or just SiSU) is the provisional -name given to the software described here that helps structure -documents for web and other publication. The name SiSU is a -loose anagram for something along the lines of "SiSU is -structuring unit", or "SiSU, information structuring -unit" or the more descriptive "Structured information, -Serialized Units" or "simple - information structuring -unit" or the more descriptive "Structured information, -Serialized Units" or what it may be directed towards -"semantic and information structuring universe" -,9 tongue in cheek, only just. Guess I'll get away with -"Simple - information Structuring Universe". SiSU -is also a Finnish word roughly meaning guts, inner strength and -perseverance.10 - - - 8. SiSU Sabaki, release version. Pre-release version SiSU -Scribe, and version prior to that SiSU nicknamed Scribbler. -Pre-release versions go back several years. Both Scribbler and Scribe -(still maintained) made system calls to SiSU's various parts, -instead of using libraries. - - - 9. A little universe it may be, but semantic you may have a hard time -getting away with, given the meaning the word has taken on with markup. -On a document wide basis semantic information may be provided, which -can be really useful, (and meaningful, especially) if you have a large -document set, and use this with rss feeds or in an sql database etc. On -a markup level, I have little inclination to add semantic markup -formally beyond references, title, author [Dublin Core entities? -addresses?] etc. Actually this deserves a bit of thought possibly use -letter tags (including letter alias/synonyms for font faces) to create -a small set of default semantic tags, with the possibility for per -document adjustments. Will seek to permit XML entity tagging, within -SiSU markup and have that ignored/removed by the parts of the -program that have no use for it. - - - 10. "Sisu refers not to the courage of optimism, but to a concept of -life that says, 'I may not win, but I will gladly give my life for what -I believe.'" Aini Rajanen, Of Finnish Ways, 1981, p. 10.
    -<http://www.humanlanguages.com/finnishenglish/rlfs.htm> -
    "Every Finn has his own pet definition. To me, sisu means -patience without passion. But there are many varieties of sisu. Sisu -can be a sudden outburst or it can be the kind that lasts. A man can -have both kinds. It is outside reason. It is something in the soul. It -comes from oneself. For instance, it makes a soldier do things because -he himself must, not because he has been told." Paavo Nurmi
    -<http://personalweb.smcvt.edu/tmatikainen/finnishtraditions.htm> -
    - 14 -
    - - - SiSU was born of the need to find a way, with minimal effort, -and for as wide a range of document types as possible, to produce high -quality publishing output in a variety of document formats. As such it -was necessary to find a simple document representation that would work -across a large number of document types, and the most convenient way(s) -to produce acceptable output formats. The project leading to this -program was started in 1993 (together with the trade law project now -known as Lex Mercatoria) as an investigation of how to -effectively/efficiently place documents on the web. The unified -document handling, together with features such as paragraph numbering, -endnote handling and tables... appeared in 1996/97. SiSU was -originally written in Perl,11 and converted to Ruby, -12 in 2000, one of the most impressive programming languages -in existence! In its current form it has been written to run on the -Gnu /Linux platform, and in particular on Debian, -13 taking advantage of many of the wonderful projects that are -available there. - - - 11. <http://www.perl.org/> - - - 12. <http://www.ruby-lang.org/en/> - - - 13. <http://www.debian.org/> - - 15 - - - - SiSU markup is based on requiring the minimum markup needed to -determine the structure of a document. (This can be as little as saying -in a header to look for the word Book at a specified level and the word -Chapter at another level). SiSU then breaks a document into its -smallest parts (at a heading, and paragraph level) while retaining all -structural information. This break up of the document and information -on its structure is taken advantage of in the transformations made in -generating the very different output types that can be created, and in -providing as much as can be for what each output type is best at doing, -e.g. LaTeX (professional document typesetting, easy conversion to pdf -or Postscript), XML (in this case, structural representation), ODF -(OpenDocument [experimental]), SQL (e.g. document search; representing -constituent parts of documents based on their structure, headings, -chapters, paragraphs as required; user control).14 - - - 14. where explicit structure is provided through the use of tagging -headings, it could be reduced (still) further, for example by reducing -the number of characters used to identify heading levels; but in many -cases even that information is not required as regular expressions can -be used to extract the implicit structure. - - 16 - - - - From markup that is simpler and more sparse than html you get: - - 17 - - - - far greater output possibilities, including html, XML, ODF -(OpenDocument), LaTeX (pdf), and SQL; - - 18 - - - - the advantages implicit in the very different output possibilities; - - 19 - - - - a common citation system (for all outputs - including the relational -database, search results are relevant for all outputs); - - 20 - - - - For more see the short summary of features provided below. - - 21 - - - - SiSU processes files with minimal tagging to produce various -document outputs including html, LaTeX or lout (which is converted to -pdf) and if required loads the structured information into an SQL -database (PostgreSQL and SQLite have been used for this). SiSU -produces an intermediate processing format.15 - - - 15. This proved to be the easiest way to develop syntax, changes could -be made, or alternatives provided for the markup syntax whilst the -intermediate markup syntax was largely held constant. There is actually -an optional second intermediate markup format in YAML <http://www.yaml.org/> - - 22 - - - - SiSU is used in constructing Lex Mercatoria <http://lexmercatoria.org/> -or <http://www.jus.uio.no/lm/> -(one of the oldest law web sites), and considerable thought went into -producing output that would be suitable for legal and academic writings -(that do not have formulae) given the limitations of html, and -publication in a wide variety of "formats", in particular in relation -to the convenient and accurate citation of text. However, the -construction of Lex Mercatoria uses only a fraction of the features -available from SiSU today, vis generation of flat file -structures, rather than in addition the building of ("granular") SQL -database content, (at an object level with relevant relational tables, -and other outputs also available). - - 23 - - - - 1.2 Short summary of features - - 24 - - - - (i) markup syntax: (a) simpler than html, (b) mnemonic, -influenced by mail/messaging/wiki markup practices, (c) human readable, -and easily writable, - - 25 - - - - (ii) (a) minimal markup requirement, (b) single file marked up -for multiple outputs, - - 26 - - - - notes: - - 27 - - - - * documents are prepared in a single UTF-8 file using a minimalistic -mnemonic syntax. Typical literature, documents like "War and Peace" -require almost no markup, and most of the headers are optional. - - 28 - - - - * markup is easily readable/parsed by the human eye, (basic markup is -simpler and more sparse than the most basic html), [this may also be -converted to XML representations of the same input/source document]. - - 29 - - - - * markup defines document structure (this may be done once in a header -pattern-match description, or for heading levels individually); basic -text attributes (bold, italics, underscore, strike-through etc.) as -required; and semantic information related to the document (header -information, extended beyond the Dublin core and easily further -extended as required); the headers may also contain processing -instructions. - - 30 - - - - (iii) (a) multiple outputs primarily industry established and -institutionally accepted open standard formats, include amongst others: -plaintext (UTF-8); html; (structured) XML; ODF (Open Document text)l; -LaTeX; PDF (via LaTeX); SQL type databases (currently PostgreSQL and -SQLite). Also produces: concordance files; document content -certificates (md5 or sha256 digests of headings, paragraphs, images -etc.) and html manifests (and sitemaps of content). (b) takes advantage -of the strengths implicit in these very different output types, (e.g. -PDFs produced using typesetting of LaTeX, databases populated with -documents at an individual object/paragraph level, making possible -granular search (and related possibilities)) - - 31 - - - - (iv) outputs share a common numbering system (dubbed "object -citation numbering" (ocn)) that is meaningful (to man and machine) -across various digital outputs whether paper, screen, or database -oriented, (PDF, html, XML, sqlite, postgresql), this numbering system -can be used to reference content. - - 32 - - - - (v) SQL databases are populated at an object level (roughly -headings, paragraphs, verse, tables) and become searchable with that -degree of granularity, the output information provides the -object/paragraph numbers which are relevant across all generated -outputs; it is also possible to look at just the matching paragraphs of -the documents in the database; [output indexing also work well with -search indexing tools like hyperesteier]. - - 33 - - - - (vi) use of semantic meta-tags in headers permit the addition of -semantic information on documents, (the available fields are easily -extended) - - 34 - - - - (vii) creates organised directory/file structure for -(file-system) output, easily mapped with its clearly defined structure, -with all text objects numbered, you know in advance where in each -document output type, a bit of text will be found (e.g. from an SQL -search, you know where to go to find the prepared html output or PDF -etc.)... there is more; easy directory management and document -associations, the document preparation (sub-)directory may be used to -determine output (sub-)directory, the skin used, and the SQL database -used, - - 35 - - - - (viii) "Concordance file" wordmap, consisting of all the words -in a document and their (text/ object) locations within the text, (and -the possibility of adding vocabularies), - - 36 - - - - (ix) document content certification and comparison -considerations: (a) the document and each object within it stamped with -an md5 hash making it possible to easily check or guarantee that the -substantive content of a document is unchanged, (b)version control, -documents integrated with time based source control system, default RCS -or CVS with use of $Id: sisu_description.sst,v 1.25 2007/08/23 12:22:36 -ralph Exp $ tag, which SiSU checks - - 37 - - - - (x) SiSU's minimalist markup makes for meaningful -"diffing" of the substantive content of markup-files, - - 38 - - - - (xi) easily skinnable, document appearance on a project/site -wide, directory wide, or document instance level easily -controlled/changed, - - 39 - - - - (xii) in many cases a regular expression may be used (once in -the document header) to define all or part of a documents structure -obviating or reducing the need to provide structural markup within the -document, - - 40 - - - - (xiii) prepared files may be batch process, documents produced -are static files so this needs to be done only once but may be repeated -for various reasons as desired (updated content, addition of new output -formats, updated technology document presentations/representations) - - 41 - - - - (xiv) possible to pre-process, which permits: the easy creation -of standard form documents, and templates/term-sheets, or; building of -composite documents (master documents) from other sisu marked up -documents, or marked up parts, i.e. import documents or parts of text -into a main document should this be desired - - 42 - - - - there is a considerable degree of future-proofing, output -representations are "upgradeable", and new document formats may be -added. - - 43 - - - - (xv) there is a considerable degree of future-proofing, output -representations are "upgradeable", and new document formats may be -added: (a) modular, (thanks in no small part to Ruby) another -output format required, write another module.... (b) easy to update -output formats (eg html, XHTML, LaTeX/PDF produced can be updated in -program and run against whole document set), (c) easy to add, modify, -or have alternative syntax rules for input, should you need to, - - 44 - - - - (xvi) scalability, dependent on your file-system (ext3, -Reiserfs, XFS, whatever) and on the relational database used (currently -Postgresql and SQLite), and your hardware, - - 45 - - - - (xvii) only marked up files need be backed up, to secure the -larger document set produced, - - 46 - - - - (xviii) document management, - - 47 - - - - (xix) Syntax highlighting for SiSU markup is available -for a number of text editors. - - 48 - - - - (xx) remote operations: (a) run SiSU on a remote server, -(having prepared sisu markup documents locally or on that server, i.e. -this solution where sisu is installed on the remote server, would work -whatever type of machine you chose to prepare your markup documents -on), (b) generated document outputs may be posted by sisu to remote -sites (using rsync/scp) (c)document source (plaintext utf-8) if shared -on the net may be identified by its url and processed locally to -produce the different document outputs. - - 49 - - - - (xxi) document source may be bundled together (automatically) -with associated documents (multiple language versions or master -document with inclusions) and images and sent as a zip file called a -sisupod, if shared on the net these too may be processed locally to -produce the desired document outputs, these may be downloaded, shared -as email attachments, or processed by running sisu against them, either -using a url or the filename. - - 50 - - - - (xxii) for basic document generation, the only software -dependency is Ruby, and a few standard Unix tools (this covers -plaintext, html, XML, ODF, LaTeX). To use a database you of course need -that, and to convert the LaTeX generated to PDF, a LaTeX processor like -tetex or texlive. - - 51 - - - - as a developers tool it is flexible and extensible - - 52 - - - - SiSU was developed in relation to legal documents, and is strong -across a wide variety of texts (law, literature...). SiSU -handles images but is not suitable for formulae/ statistics, or for -technical writing at this time. - - 53 - - - - SiSU has been developed and has been in use for several years. -Requirements to cover a wide range of documents within its use domain -have been explored. - - 54 - - - - Some modules are more mature than others, the most mature being Html -and LaTeX / pdf. PostgreSQL and search functions are useable and -together with ocn unique (to the best of my knowledge). The XML -output document set is "well formed" but largely proof of concept. - - 55 - - - - 1.3 How it works - - 56 - - - - SiSU markup is fairly minimalistic, it consists of: a (largely -optional) document header, made up of information about the document -(such as when it was published, who authored it, and granting what -rights) and any processing instructions; and markup within text which -is related to document structure and typeface. SiSU must be able -to discern the structure of a document, (text headings and their levels -in relation to each other), either from information provided in the -instruction header or from markup within the text (or from a -combination of both). Processing is done against an abstraction of the -document comprising of information on the document's structure and its -objects,16 which the program serializes (providing the object -numbers) and which are assigned hash sum values based on their content. -This abstraction of information about document structure, objects, (and -hash sums), provides considerable flexibility in representing documents -different ways and for different purposes (e.g. search, document -layout, publishing, content certification, concordance etc.), and makes -it possible to take advantage of some of the strengths of established -ways of representing documents, (or indeed to create new ones). - - - 16. objects include: headings, paragraphs, verse, tables, images, but -not footnotes/endnotes which are numbered separately and tied to the -object from which they are referenced. - - 57 - - - - 1.4 Simple markup - - 58 - - - - SiSU markup is based on requiring the minimum markup needed to -determine the structure of a document. (This can be as little as saying -in a header to look for the word Book at a specified level and the word -Chapter at another level). SiSU then breaks a document into its -smallest parts (at a heading, and paragraph level) while retaining all -structural information. This break up of the document and information -on its structure is taken advantage of in the transformations made in -generating the very different output types that can be created, and in -providing as much as can be for what each output type is best at doing, -e.g. LaTeX (professional document typesetting, easy conversion to pdf -or Postscript), XML (in this case, structural representation), ODF -(OpenDocument), SQL (e.g. document search; representing constituent -parts of documents based on their structure, headings, chapters, -paragraphs as required; user control).17 - - - 17. where explicit structure is provided through the use of tagging -headings, it could be reduced (still) further, for example by reducing -the number of characters used to identify heading levels; but in many -cases even that information is not required as regular expressions can -be used to extract the implicit structure. - - 59 - - - - 1.4.1 Sparse markup requirement, try to get the most out of markup - - 60 - - - - One of its strengths is that very small amounts of initial tagging is -required for the program to generate its output. - - 61 - - - - This is a basic markup example: - - 62 - - - - -basic markup example, text file - an international convention -18 - - - 18. <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> -output provided as example in the next section - - 63 - - - - -view basic markup, as it would be highlighted by vim editor -19 - - - 19. <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> -as it would appear with syntax highlighting (by vim) - - 64 - - - - Emphasis has been on simplicity and minimalism in markup requirements. -Design philosophy is to try keep the amount of markup required low, for -whatever has been determined to be acceptable output.20 - - - 20. seems there are several "smart ASCIIs" available, primarily for -ascii to html conversion, that make this, and reasonable looking ascii -their goal
    <http://webseitz.fluxent.com/wiki/SmartAscii> -
    <http://daringfireball.net/projects/markdown/> -
    <http://www.textism.com/tools/textile/> -
    - 65 -
    - - - SiSU's markup is more minimalistic and simpler than (the -equivalent) html and for it, you get considerably more than just html, -as this preparation gives you all available output formats, upon -request. - - 66 - - - - 1.4.2 Single markup file provides multiple output formats - - 67 - - - - For each document, there is only one (input, minimalistically marked -up) file from which all the available output types are -generated.21 - - - 21. These include richly laid out and linked html (table or css -variants), PHP, LaTeX (from which pdf portrait and landscape -documents are produced), texinfo (for info files etc.), and PostgreSQL -and/or SQLite. And the opportunity to fairly easily build additional -modules, such as XML. See the examples provided in this document. - - 68 - - - - Eg. the markup example: - - 69 - - - - -original text file - an international convention 22 - - - 22. <http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst> - - 70 - - - - -view as syntax would be highlighted by vim editor 23 - - - 23. <http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html> - - 71 - - - - Produces the following output: - - 72 - - - - -Segmented html version of document 24 - - - 24. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html> - - 73 - - - - -Full length html document 25 - - - 25. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/doc.html> - - 74 - - - - -pdf landscape version of document 26 - - - 26. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/landscape.pdf> - - 75 - - - - -pdf portrait version of document 27 - - - 27. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/portrait.pdf> - - 76 - - - - -clean tex ascii version of document 28 - - - 28. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/plain.txt> - - 77 - - - - -xml sax version of document 29 - - - 29. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/sax.xml> - - 78 - - - - -xml dom version of document 30 - - - 30. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/dom.xml> - - 79 - - - - -Concordance 31 - - - 31. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/concordance.html> - - 80 - - - - (and in addition to these: PostgreSQL, SQLite, texinfo and -YAML 32 versions if desired) - - - 32. discontinued for the time being - - 81 - - - - 1.4.3 Syntax relatively easy to read and remember - - 82 - - - - Syntax is kept simple and mnemonic.33 - - - 33. SiSU markup syntax, an incomplete summary: <http://www.jus.uio.no/sisu/sisu_markup_table/doc.html#h200306> -
    Visual check of elementary font face modifiers: bold -bold emphasis italics underscore -strikethrough superscript subscript -
    - 83 -
    - - - 1.4.4 Kept simple by having a limited publishing feature set, and -features identified as most important, are available across several -document types - - 84 - - - - To keep SiSU markup sparse and simple SiSU deliberately -provides a limited publishing feature set, including: indent levels; -bold; italics; superscript; subscript; simple tables; images; tables of -contents and; endnotes. Which in most cases are available across the -different output formats. - - 85 - - - - The publishing feature set may be expanded as required. - - 86 - - - - 1.5 Designed with usability in mind - - 87 - - - - Output is designed to be uniform, easy to read, navigate and cite. - - 88 - - - - 1.6 Code separate from content - - 89 - - - - Code34 is separated from content. This means that when changes -are desired in the output presentation, the code that produces them, -and not the marked up text data set (which could be thousands of -documents) is modified. Separating code from content makes large scale -changes to output appearance trivial, and permits the easy addition of -new output modules. - - - 34. the program that generates the documents - - 90 - - - - 1.7 Object citation numbering, a text or object positioning / citation -system - "paragraph" (or text object) numbering, that remains same and -usable across all output formats by people and machine - - - 91 - - - - Object citation numbering is a simple object (text) positioning and -cition system that is human relevant and machine useable, used by -SiSU for all manner of presentations, and that is available for -use in all text mappings. It is based on the automated sequential -numbering of objects (roughly paragraphs, (headings, tables, verse) or -other blocks of text or images etc.). The text positioning system (in -which I claim copyright) is invaluable for publishing requiring the -citing text across multiple output formats, and for the general mapping -of text within a document: - - 92 - - - - in html, html not being easily citeable (change font size, or use a -different browser and the page on which specific text appears has -changed), and - - 93 - - - - across multiple formats being common to all output formats -html/xml/pdf/sql output, - - 94 - - - - the results of an sql search can just be "live" citation references to -the documents in which the text is found, much like -an index (see image examples provided). 35 - - - 35. <http://www.jus.uio.no/sisu/SiSU/1.html#search> - - 95 - - - - I claim copyright on the system I use which is the most basic of all, -numbering all text in headings and paragraphs sequentially (with tables -and images being treated as a single paragraph) and only -footnotes/endnotes not following this numbering, as their position in -text is not strictly determined, (a change from footnotes to endnotes -would change their numbering), footnotes instead "belong" to the -paragraph from which they are referenced, and have sequential numbers -of their own. - - 96 - - - - SiSU has a paragraph numbering system, that remains the same -regardless of the output format. This provides an effective means of -citation, pinpointing text accurately in all output formats, using the -same reference. This is particularly useful where text has to be -located across different output formats - for example once html is -printed the number of pages and pages on which given text is found will -vary depending on the browser, its settings the font size setting etc. -Similarly SiSU produces pdf in different forms, eg. on the -example site Lex Mercatoria as portrait and landscape documents - here -too page numbering varies, but paragraph numbering is the same, vis -a vis all versions of the text (portrait and landscape pdf and the -html versions of the text, and as stored (with "paragraphs" as records) -to the PostgreSQL or SQLite database). - - 97 - - - - These numbers are placed in the text margins and are intended to be -independent of and not to interfere with authors tagging. [The citation -system (object citation numbering system, automated "paragraph -numbering") which is automatically generated and is common and -identical across all document formats] The paragraph numbering system -is more accurately described as an (text) object numbering system, as -headings are also numbered... all headings and paragraphs are numbered -sequentially. Endnotes are automatically numbered independently and -rather "belong" to the paragraph from which they are referenced, as an -endnote does not (necessarily) form a part of a documents sequence, -(they may be produced as either endnotes or footnotes (or both -depending on what output you choose to look at - if you take the -segmented html version document provided as an example, you will find -that the endnotes are placed both at the end of each section, and in a -separate section of their own called endnotes, and these are -hyper-linked)). An attractive feature of providing citation numbering -in this way is that it is independent of the document structure... it -remains the same regardless of what is done about the document -structure. - - 98 - - - - The rules have been kept very simple, unique incremental object -citation numbers are assigned to headings, paragraphs, verse, tables -and images. It is possible to manually override this feature on a per -heading or comment basis though this should be used exceptionally, it -may be of use where there a substantive text, and the addition of a -minor comment by the publisher that should not be mapped as part of the -text. - - 99 - - - - The object citation number markers contain additional numbering -information with regard to the document structure, that can be used for -alternative presentations, including such detail as the type of object -(heading, paragraph, table, image, etc.), numbered sequentially. - - 100 - - - - An advantage is that the numbering remains the same regardless of -document structure. - - 101 - - - - Text object ("paragraph") numbering is the same for all output versions -of the same document, vis html, pdf, pgsql, yaml etc. - - 102 - - - - In the relational database, as individual text objects of a document -stored (and indexed) together with object numbers, and all versions of -the document have the same numbering, the results of searches may be -tailored just to provide the location of the search result in all -available document formats. - - 103 - - - - Note: there is a bug in the released behaviour of object citation -numbering, (not certain when it was introduced) tables should be -numbered, ie each table gets an ocn, required amongst other things for -relational database. This will be corrected in a future release. -Citation numbering of existing documents that contain tables will -changed. - - 104 - - - - 1.8 Handling of Dublin Core meta-tags making use of the Resource -Description Framework - - 105 - - - - SiSU is able to use meta tags based on the Dublin -Core36 and Resource Description Framework37 - - - 36. <http://dublincore.org/> - - - 37. <http://www.w3.org/RDF/> - - 106 - - - - This provides the means of providing semantic information about a -document, both as computer processable meta-tags, and as human readable -information that may be of value for classification purposes. - - 107 - - - - This information is provided both in html metatags, and (where -available) under the section titled "Document Information - MetaData", -near the end of a document, for example in the segmented html version -of this text at: <http://www.jus.uio.no/sisu/SiSU/metadata.html> - - 108 - - - - 1.9 Easy directory management - - 109 - - - - 1. Directory file association, skins and special image management, made -simpler.38 - - - 38. The previous way was directory associations for file output were set -up in the configuration file. The present system is a more natural way -to work requireing less configuration. - - 110 - - - - The last part of the name of the work directory in which markup is -being done, or rather from where SiSU is run in order to -generate document output, is used in determining the sub-directory name -for output files, that is created in the document output directory. -This provides a rather easy way to associate documents e.g. of a given -subject, or by owner. - - 111 - - - 112 - -      /www/docs
             /intellectual_property
             /arbitration
             /contract_law

         /www/docs
             /ralph
             /sisu     -
    -
    - - - all are placed in their own directories within the directory structure -created. Similar rules are used in the creation of sql type databases -(though they can be overridden). - - 113 - - - - There are a couple of further associations with these directories. - - 114 - - - - Directory wide skins. - - 115 - - - - Directory specific images. - - 116 - - - - 2. If there is a "directory skin", that is a skin of the same name as -the directory, it is used in the generation of the documents within it, -rather than the default skin, unless the document has a specific skin -associated with it. - - 117 - - - - a. default skin (always available) - - 118 - - - - b. directory skin (precedence over default if exists) - - 119 - - - - c. document skin (takes precedence wherever document requests a -specific skin) - - 120 - - - - Skins are defined in the document skin directory and if a directory -association is desired a softlink made to the relevant skin. Skins -(directory association auto load) auto load skin if a directory skin -exists of same name as directory stub, (and there is no specific doc -skin) - - 121 - - - - 3. If the working directory has within it a sub-directory called -image_local, the images within that directory are used for references -to images, that are not part of the default site build. - - 122 - - - - 1.10 Document Version Control Information - - 123 - - - - The possibility of citing an exact document version. - - 124 - - - - Permits the inclusion of document version control information to the -document body and metatags.39 This provides a much more -certain method of referring to the exact version of a particular -document, (assuming that the document is from a trusted source, that -will retain earlier versions of a document).40 - - - 39. from a version control system such as CVS - - - 40. The version control system must be run, so the version number is -obtained, prior to the SiSU document generation, and subsequent -posting of the document. - - 125 - - - - This information (where available) is provided under the section of the -document titled "Document Information - MetaData", near the end of a -document, for example in the segmented html version of this text at: -<http://www.jus.uio.no/sisu/SiSU/metadata.html> - - 126 - - - - 1.11 Table of contents - - 127 - - - - SiSU produces a rudimentary a table of contents based on -document headings. - - 128 - - - - 1.12 Auto-numbering of headings - - 129 - - - - Headings can be automatically numbered, (and automatically named for -hyper-linking) - - 130 - - - - 1.13 Numbering and cross-hyperlinking of endnotes - - 131 - - - - SiSU can automatically number footnotes/endnotes. This is the -default operation where no number is provided. - - 132 - - - - Footnotes/endnotes may also be manually numbered. Where a number, or -numbers are provided for a footnote/endnote, this does not increment -the automatic footnote/endnote number counter. - - 133 - - - - In the html output footnotes/endnotes are cross-hyper-linked (to their -reference point and vice versa). In th pdf output footnotes are linked -from their reference point only. - - 134 - - - - 1.14 "Skinnable" - - 135 - - - - SiSU is skinnable, on a site-wide, directory-wide and per -document basis, so different looking versions of things may be produced -with little difficulty. There is a default skin which may be modified, -as the background site skin, and each working directory may have a skin -associated with it, as may each individual document. The hierarchy of -application is document, directory, then site... ie if a document skin -exists it gets precedence. - - 136 - - - - Whilst it is skinnable, the default output styles are selected to work -across the widest possible range of document types. - - 137 - - - - 1.15 Multiple Outputs - - 138 - - - - From markup that is simpler and more sparse than html you get: - - 139 - - - - far greater output possibilities, including multiple html types, XML -(different structured types), LaTeX (pdf landscape, portrait), and SQL -(Postgresql or SQLite or other); - - 140 - - - - the advantages implicit in these very different output -possibilities;41 - - - 41. e.g. LaTeX (professional document typesetting, easy conversion to -pdf or Postscript), XML (in this case, structural representation), SQL -(e.g. document set searches; representation of the constituent parts of -documents based on their structure, headings, chapters, paragraphs as -desired; control of use) - - 141 - - - - a common citation system - - 142 - - - - As many output formats/presentations as one cares to write modules for -- several types of html (e.g. structure based on css, or structure -based on tables); LaTeX/pdf and Lout/pdf; pgsql other -databases easily added; yaml... - - 143 - - - - 1.15.1 html - several presentations: full length & segmented; css -& table based - - 144 - - - - Most documents are produced in single and segmented html versions, -described below: - - 145 - - - - The Scroll (full length text presentations) - - 146 - - - - The full length of the text in a single scrollable document.42 -As a rule the files they are saved in are named: doc or more -precisely doc.html - - - 42. CISG <http://www.jus.uio.no/lm/un_contracts_international_sale_of_goods_convention_1980/doc> -
    The Unidroit Contract Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994/doc> -or
    The Autonomous Contract <http://www.jus.uio.no/lm/autonomous.contract.2000.amissah/doc> -
    - 147 -
    - - - For various reasons texts may only be provided in this form (such as -this one which is short), though most are also provided as segmented -texts. - - 148 - - - - "Scroll" is a reference to the historical scroll, a single long -document/ parchment, and also no doubt to what you will have to do to -get to the bottom of the text.43 - - - 43. Scrolling is not however necessarily confined to full length -documents as you will have to scroll to get to the bottom of any long -segment (eg. chapter) of a segmented text. - - 149 - - - - The Segmented Text - - 150 - - - - The text divided into segments (such as articles or chapters depending -on the text)44 As a rule the files they are saved in are -named: toc and index or more precisely toc.html -and index.html - - - 44. CISG <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980> -
    The Unidroit Principles <http://www.jus.uio.no/lm/unidroit.contract.principles.1994> -
    The Autonomous Contract <http://www.jus.uio.no/sisu/the.autonomous.contract.2000.amissah> -or
    WTA 1994 <http://www.jus.uio.no/lm/wta.1994> -
    - 151 -
    - - - If you know exactly what you are looking for, loading a segment of text -is faster (the segments being smaller). Occasionally longer documents -such as the WTA 1994 <http://www.jus.uio.no/lm/wta.1994/toc> -are only provided in segmented form. - - 152 - - - - Cascading Style Sheet, and Table based html - - 153 - - - - SiSU outputs html, two current standard forms available are: - - 154 - - - - css based - - - 155 - - - - and - - 156 - - - - table based [largely discontinued ]45 - - - 45. formatting possibility still exists in code tree but maintenance has -been largely discontinuted. - - 157 - - - - The html is tested across several browsers - - 158 - - - - I like to remind you that there are other excellent browsers out there, -many of which have long supported practical features like tabbing. - - 159 - - - - The html is tested across several browsers, including: - - 160 - - - - Firefox -(Mozilla-Firefox) 46 - - - 46. <http://www.mozilla.org/products/firefox/> - - 161 - - - - Kazehakase -47 - - - 47. <http://kazehakase.sourceforge.jp/> - - 162 - - - - Konqueror 48 - - - 48. <http://www.konqueror.org/> - - 163 - - - - Mozilla 49 - - - 49. <http://www.mozilla.org/> - - 164 - - - - MS -Internet Explorer 50 - - - 50. <http://www.microsoft.com/windows/ie/default.asp> - - 165 - - - - -Netscape 51 - - - 51. <http://home.netscape.com/comprod/mirror/client_download.html> - - 166 - - - - Opera 52 - - - 52. <http://www.opera.com/> - - 167 - - - - Also lighter weight graphical browsers: - - 168 - - - - Dillo 53 - - - 53. <http://www.dillo.org/> - - 169 - - - - Epiphany - 54 - - - 54. <http://www.gnome.org/projects/epiphany/> - - 170 - - - - Galeon -55 - - - 55. <http://galeon.sourceforge.net/> - - 171 - - - - And for console/text browsing: - - 172 - - - - elinks 56 - - - 56. <http://elinks.or.cz/> - - 173 - - - - links2 -57 - - - 57. <http://links.twibright.com/> - - 174 - - - - w3m -58 - - - 58. <http://w3m.sourceforge.net/> - - 175 - - - - The html tables output is rendered more accurately across a wider -variety set and older versions of browsers (than the html css output). - - 176 - - - - 1.15.2 XML - - 177 - - - - SiSU generates well formed XML, and multiple versions. An XML -SAX version with a flat/shallow structure, and XML DOM version with a -deeper (embedded) structure. There is also a released working xhtml -module. Examples of SAX and DOM versions are provided within this -document. - - 178 - - - - 1.15.3 ODT:ODF, Open Document Format - ISO/IEC 26300:2006 - - 179 - - - - SiSU generates Open Document Output format. - - 180 - - - - 1.15.4 PDF - portrait and landscape, (through the generation of LaTeX -output which is then transformed to pdf) - - 181 - - - - SiSU outputs LaTeX if required which is easily transformed to -PDF.59 PDF documents are generated on the site from the same -source files and Ruby program that produce html. Landscape -oriented pdf introduced, providing easier screen viewing, they are also -(paper saving, being currently) formatted to have fewer pages than -their portrait equivalents. - - - 59. LaTeX and pdf features introduced 18th June 2001, -Landscape and portrait pdfs introduced 7th October 2001., -Lout is a more recent addition 22th April 2003 - - 182 - - - - -Adobe Reader 60 - - - 60. <http://www.adobe.com/products/acrobat/readstep2.html> - - 183 - - - - Evince - 61 - - - 61. <http://www.gnome.org/projects/evince/> - - 184 - - - - xpdf 62 - - - 62. <http://www.foolabs.com/xpdf/> - - 185 - - - - 1.15.5 Search - loading/populating of relational database while -retaining document structure information, object citation numbering and -other features (currently PostgreSQL and/or SQLite) - - 186 - - - - SiSU (from the same markup input file) automatically feeds into -PostgreSQL63 and/or SQLite64 database (could be any -other of the better relational databases)65 - together with -all additional information related to document structure, and the -alternative ways in which it is generated on the site retained. As -regards scaling of the database, it is as scalable as the database -(here Postgresql or SQLite) and hardware allow. I will prune the images -later. - - - 63. <http://www.postgresql.org/> -
    <http://advocacy.postgresql.org/> -
    <http://en.wikipedia.org/wiki/Postgresql> -
    - - 64. <http://www.hwaci.com/sw/sqlite/> -
    <http://en.wikipedia.org/wiki/Sqlite> -
    - - 65. Relational database features retaining document structure and -citation introduced 15th July 2002 - - 187 -
    - - - This is one of the more interesting output forms, as all the structural -data for the documents are retained (though can be ignored by the user -of the database should they so choose). All site texts/documents are -(currently) streamed to four pgsql database tables: - - 188 - - - - one containing semantic (and other) headers, including, title, -author, subject, (the Dublin Core...); - - 189 - - - - another the substantive texts by individual "paragraph" (or -object) - along with structural information, each paragraph being -identifiable by its paragraph number (if it has one which almost all of -them do), and the substantive text of each paragraph quite naturally -being searchable (both in formatted and clean text versions for -searching); and - - 190 - - - - a third containing endnotes cross-referenced back to the -paragraph from which they are referenced (both in formatted and clean -text versions for searching). - - 191 - - - - a fourth table with a one to one relation with the headers table -contains full text versions of output, eg. pdf, html, xml, and ascii. - - 192 - - - - There is of course the possibility to add further structures. - - 193 - - - - At this level SiSU loads a relational database with documents -broken in to their smallest logical structurally constituent parts, as -text objects, with their object citation number and all other -structural information needed to construct the structured document. -Text is stored (at this text object level) with and without elementary -markup tagging, the stripped version being so as to facilitate ease of -searching. - - 194 - - - - Because the document structure of sites created is clearly defined, and -the text object citation system is available for all forms of output, -it is possible to search the sql database, and either read results from -that database, or just as simply map the results to the html output, -which has richer text markup. - - 195 - - - - The combination of the SiSU citation system with a relational -database is pretty powerful, giving rise to several possibilities. As -individual text objects of a document stored (and indexed) together -with object numbers, and all versions of the document have the same -numbering, complex searches can be tailored to return just the -locations of the search results relevant for all available output -formats, with live links to the precise locations in the database or in -html/xml documents; or, the structural information provided makes it -possible to search the full contents of the database and have headings -in which search content appears, or to search only headings etc. (as -the Dublin Core is incorporated it is easy to make use of that as -well). - - 196 - - - - This is a larger scale project, (with little development on the front -end largely ignored), though the "infrastructure" has been in place -since 2002. - - 197 - - - - 1.15.6 Search - database frontend sample, utilising database and SiSU -features, including object citation numbering (backend currently -PostgreSQL) - - 198 - - - - Sample search frontend -66 A small database and sample query front-end (search from) -that makes use of the citation system, object citation numbering -to demonstrates functionality.67 - - - 66. <http://search.sisudoc.org> - - - 67. (which could be extended further with current back-end). As regards -scaling of the database, it is as scalable as the database (here -Postgresql) and hardware allow. - - 199 - - - - SiSU can provide information on which documents are matched and -at what locations within each document the matches are found. These -results are relevant across all outputs using object citation -numbering, which includes html, XML, LaTeX, PDF and indeed the SQL -database. You can then refer to one of the other outputs or in the SQL -database expand the text within the matched objects (paragraphs) in the -documents matched. - - 200 - - - - (further work needs to be done on the sample search form, which is -rudimentary and only passes simple booleans correctly at present to the -SQL engine) - - 201 - - - - A few canned searches, showing object numbers. Search for: - - 202 - - - - -English documents matching Linux OR Debian - - 203 - - - - -GPL OR Richard Stallman - - 204 - - - - -invention OR innovation in English language - - 205 - - - - -copyright in English language documents - - 206 - - - - Note that the searches done in this form are case sensitive. - - 207 - - - - Expand those same searches, showing the matching text in each document: - - 208 - - - - -English documents matching Linux OR Debian - - 209 - - - - -GPL OR Richard Stallman - - 210 - - - - -invention OR innovation in English language - - 211 - - - - -copyright in English language documents - - 212 - - - - Note you may set results either for documents matched and object number -locations within each matched document meeting the search criteria; or -display the names of the documents matched along with the objects -(paragraphs) that meet the search criteria.68 - - - 68. of this feature when demonstrated to an IBM software innovations -evaluator in 2004 he said to paraphrase: this could be of interest to -us. We have large document management systems, you can search hundreds -of thousands of documents and we can tell you which documents meet your -search criteria, but there is no way we can tell you without opening -each document where within each your matches are found. - - 213 - - - - OCN index mode, (object citation number) the numbers displayed -are relevant (and may be used to reference the match) in any sisu -generated rendition of the text69 the links provided are to -the locations of matches within the html generated by SiSU. - - - 69. OCN are provided for HTML, XML, pdf ... though currently omitted in -plain-text and opendocument format output - - 214 - - - - Paragraph mode, you may alternatively display the text of each -paragraph in which the match was made, again the object/paragraph -numbers are relevant to any SiSU generated/published text. - - 215 - - - - Several options for output - select database to search, show results in -index view (links to locations within text), show results with text, -echo search in form, show what was searched, create and show a "canned -url" for search, show available search fields. Also shows counters -number of documents in which found and number of locations within -documents where found. [could consider sorting by document with most -occurrences of the search result]. - - 216 - - - - Earlier version of the search frontend - Simple search, results with -files in which search found, and locations where found within files. - - 217 - - - - Simple search, results with files in which search found, and text -object (paragraph or endnote) where found within files. - - 218 - - - - 1.15.7 Other forms - - 219 - - - - There are other forms as well, YAML file, Ruby Marshal dumps, -document pre-processing (processing of documents prior to the steps -described here, to produce input suitable for the program) snap in a -new module as required/desired, well formed XML, no problem. - - 220 - - - - 1.16 Concordance / Word Map or rudimentary index - - 221 - - - - Concordance /WordMaps:70 SiSU produces a rudimentary -index based on the words within the text, making use of paragraph -numbers to identify text locations. This is generated in html and -hyper-linked but identifies these words locations in the other document -formats. Though it is possible to search using a search engine, this is -a means for browsing an alphabetical list of words which may suggest -other useful content. - - - 70. Concordance/ WordMaps introduced 15th August 2002 - - 222 - - - - 1.17 Managed (document) directory, database, or site structure - - 223 - - - - SiSU builds the web site (or more generically provides a -suitable directory structure) - placing various output texts in the -hierarchy of the web-site (or db), which (for directories) is a -sub-directory with the name of the text file. - - 224 - - - - 1.18 Batch processing - - 225 - - - - SiSU is a batch processing tool, handling and transforming -multiple (or individual) documents (in many ways) with a single -instruction. - - 226 - - - - 1.19 Integration to superior Gnu/Linux and Unix tools - - 227 - - - - As should have been noted by the above description of SiSU, it -makes use of existing programs found on Gnu /Linux and Unix, -amongst those already mentioned include the LaTeX to pdf converters and -the database PostgreSQL or SQLite. - - 228 - - - - 1.19.1 Backup and version control - - 229 - - - - Unix provides many tools for version control. For documents Subversion, -CVS and even the old RCS are useful for the per-document histories they -provide. - - 230 - - - - For writing code superior (more recent) version control system exist. -These can also be used for documents though they tend to take stamps of -changes across the repository as a whole, rather than for each -individual file that is tracked, (as CVS and RCS do). My personal -preference is for distributed systems such as Git, Mercurial or Darcs, -of which I use Git for both code and documents. - - 231 - - - - Several backup tools exist. At the base level I tend to use rdiff. - - 232 - - - - 1.19.2 Editor support - - 233 - - - - SiSU documents are prepared / marked up in utf-8 text you are -free to use the text editor of your choice. - - 234 - - - - Syntax highlighting for a number of editors are provided. Amongst them -Vim, Kwrite, Kate, Gedit and diakonos. These may be found with -configuration instructions at <http://www.jus.uio.no/sisu/syntax_highlight>. - Vim 71 as of version -7 has built in sytax highlighting for SiSU. - - - 71. <http://www.vim.org/> - - 235 - - - - 1.20 Modular design, need something new add a module - - 236 - - - - Need a new output format that does not already exist, write a new -module. - - 237 - - - - Prefer a new input syntax, you could write a new syntax matching the -existing design, though my personal preference is some uniformity in -entry appearance. If necessary has been fairly easy to extend the -design parameters. It is intended to incorporate some additional basic -semantic tagging, (book, article, author etc.) However, keeping the -requirements for input minimal, and relatively simple has been a design -goal. - - 238 - - - - Endnotes - - 0 - - -
    diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/sisu_description.sst b/data/doc/manuals_generated/sisu_manual/sisu_description/sisu_description.sst deleted file mode 100644 index 05930df7..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/sisu_description.sst +++ /dev/null @@ -1,557 +0,0 @@ -% SiSU 0.57 - -@title: SiSU - -@subtitle: Description - -@creator: Ralph Amissah - -@rights: Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3 - -@type: information - -@subject: ebook, epublishing, electronic book, electronic publishing, electronic document, electronic citation, data structure, citation systems, search - -@date.created: 2002-11-12 - -@date.issued: 2002-11-12 - -@date.available: 2002-11-12 - -@date.modified: 2007-08-30 - -@date: 2007-08-30 - -@level: new=C; break=1; num_top=1 - -@skin: skin_sisu_manual - -@bold: /Gnu|Debian|Ruby|SiSU/ - -@links: { SiSU Manual }http://www.jus.uio.no/sisu/sisu_manual/ -{ Book Samples and Markup Examples }http://www.jus.uio.no/sisu/SiSU/2.html -{ SiSU @ Wikipedia }http://en.wikipedia.org/wiki/SiSU -{ SiSU @ Freshmeat }http://freshmeat.net/projects/sisu/ -{ SiSU @ Ruby Application Archive }http://raa.ruby-lang.org/project/sisu/ -{ SiSU @ Debian }http://packages.qa.debian.org/s/sisu.html -{ SiSU Download }http://www.jus.uio.no/sisu/SiSU/download.html -{ SiSU Changelog }http://www.jus.uio.no/sisu/SiSU/changelog.html -{ SiSU help }http://www.jus.uio.no/sisu/sisu_manual/sisu_help/ -{ SiSU help sources }http://www.jus.uio.no/sisu/sisu_manual/sisu_help_sources/ - -% @rcs:+ $Id$ - -:A~? @title @creator - -:B~? SiSU an attempt to describe - -1~ Description - -2~ Outline - -SiSU is a flexible document preparation, generation publishing and search system.~{ This information was first placed on the web 12 November 2002; with predating material taken from http://www.jus.uio.no/lm/lm.information/toc.html part of a site started and developed since 1993. See document metadata section http://www.jus.uio.no/sisu/SiSU/metadata.html for information on this version. Dates related to the development of SiSU are mostly contained within the Chronology section of this document, e.g. http://www.jus.uio.no/sisu/sisu_chronology }~ - -SiSU ("SiSU information Structuring Universe" or "Structured information, Serialized Units"),~{ also chosen for the meaning of the Finnish term "sisu". }~ is a Unix command line oriented framework for document structuring, publishing and search. Featuring minimalistic markup, multiple standard outputs, a common citation system, and granular search. - -Using markup applied to a document, SiSU can produce plain text, HTML, XHTML, XML, OpenDocument, LaTeX or PDF files, and populate an SQL database with objects~{ objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. }~ (equating generally to paragraph-sized chunks) so searches may be performed and matches returned with that degree of granularity (e.g. your search criteria is met by these documents and at these locations within each document). Document output formats share a common object numbering system for locating content. This is particularly suitable for "published" works (finalized texts as opposed to works that are frequently changed or updated) for which it provides a fixed means of reference of content. - -SiSU is the data/information structuring and transforming tool, that has resulted from work on one of the oldest law web projects. It makes possible the one time, simple human readable markup of documents, that SiSU can then publish in various forms, suitable for paper~{ pdf via LaTeX or lout }~, web~{ currently html (two forms of html presentation one based on css the other on tables), and /PHP/; potentially structured XML }~ and relational database~{ any SQL - currently PostgreSQL and /sqlite/ (for portability, testing and development) }~ presentations, retaining common data-structure and meta-information across the output/presentation formats. Several requirements of legal and scholarly publication on the web have been addressed, including the age old need to be able to reliably cite/pinpoint text within a document, to easily make footnotes/endnotes, to allow for semantic document meta-tagging, and to keep required markup to a minimum. These and other features of interest are listed and described below. A few points are worth making early (and will be repeated a number of times): - -_1 (i) The SiSU document generator was the first to place material on the web with a system that makes possible citation across different document types, with paragraph, or rather object citation numbering~{ previously called "text object numbering" }~ a text positioning system, available for the pinpointing of text, 1997, a simple idea from which much benefit, and SiSU remains today, to the best of my knowledge, the only multiple format e-book/ electronic-document system on the web that gives you this possibility (including for relational databases). - -_1 (ii) Markup is done once for the multiple formats produced. - -_1 (iii) Markup is simple, and human readable (with a little practice), in almost all cases there is less and simpler markup required than basic html. In any event the markup required is very much simpler than the html, LaTeX, [lout], structured XML, ODF (OpenDocument), PostgreSQL or SQLite feed etc. that you can have SiSU generate for you. - -_1 (iv) SiSU is a batch processor, dealing with as many files as you need to generate at a time. - -_1 (v) Scalability is dependent on your file system (in my case Reiserfs), the database (currently Postgresql and/or SQLite) and your hardware. - -SiSU Sabaki~{ SiSU Sabaki, release version. Pre-release version SiSU Scribe, and version prior to that SiSU nicknamed Scribbler. Pre-release versions go back several years. Both Scribbler and Scribe (still maintained) made system calls to SiSU's various parts, instead of using libraries. }~ (or just SiSU) is the provisional name given to the software described here that helps structure documents for web and other publication. The name SiSU is a loose anagram for something along the lines of *{/{"SiSU is structuring unit"}/}*, or /{"SiSU, information structuring unit"}/ or the more descriptive /{"Structured information, Serialized Units"}/ or *{/{"simple - information structuring unit"}/}* or the more descriptive /{"Structured information, Serialized Units"}/ or what it may be directed towards /{"*semantic* and *{information structuring universe}*"}/,~{ A little universe it may be, but semantic you may have a hard time getting away with, given the meaning the word has taken on with markup. On a document wide basis semantic information may be provided, which can be really useful, (and meaningful, especially) if you have a large document set, and use this with rss feeds or in an sql database etc. On a markup level, I have little inclination to add semantic markup formally beyond references, title, author [Dublin Core entities? addresses?] etc. Actually this deserves a bit of thought possibly use letter tags (including letter alias/synonyms for font faces) to create a small set of default semantic tags, with the possibility for per document adjustments. Will seek to permit XML entity tagging, within SiSU markup and have that ignored/removed by the parts of the program that have no use for it. }~ tongue in cheek, only just. Guess I'll get away with *{/{"Simple - information Structuring Universe"}/}*. SiSU is also a Finnish word roughly meaning guts, inner strength and perseverance.~{ "Sisu refers not to the courage of optimism, but to a concept of life that says, 'I may not win, but I will gladly give my life for what I believe.'" Aini Rajanen, Of Finnish Ways, 1981, p. 10.
    http://www.humanlanguages.com/finnishenglish/rlfs.htm
    "Every Finn has his own pet definition. To me, sisu means patience without passion. But there are many varieties of sisu. Sisu can be a sudden outburst or it can be the kind that lasts. A man can have both kinds. It is outside reason. It is something in the soul. It comes from oneself. For instance, it makes a soldier do things because he himself must, not because he has been told." Paavo Nurmi
    http://personalweb.smcvt.edu/tmatikainen/finnishtraditions.htm }~ - -SiSU was born of the need to find a way, with minimal effort, and for as wide a range of document types as possible, to produce high quality publishing output in a variety of document formats. As such it was necessary to find a simple document representation that would work across a large number of document types, and the most convenient way(s) to produce acceptable output formats. The project leading to this program was started in 1993 (together with the trade law project now known as Lex Mercatoria) as an investigation of how to effectively/efficiently place documents on the web. The unified document handling, together with features such as paragraph numbering, endnote handling and tables... appeared in 1996/97. SiSU was originally written in Perl,~{ http://www.perl.org/ }~ and converted to Ruby,~{ http://www.ruby-lang.org/en/ }~ in 2000, one of the most impressive programming languages in existence! In its current form it has been written to run on the Gnu/Linux platform, and in particular on Debian,~{ http://www.debian.org/ }~ taking advantage of many of the wonderful projects that are available there. - -SiSU markup is based on requiring the minimum markup needed to determine the structure of a document. (This can be as little as saying in a header to look for the word Book at a specified level and the word Chapter at another level). SiSU then breaks a document into its smallest parts (at a heading, and paragraph level) while retaining all structural information. This break up of the document and information on its structure is taken advantage of in the transformations made in generating the very different output types that can be created, and in providing as much as can be for what each output type is best at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), ODF (OpenDocument [experimental]), SQL (e.g. document search; representing constituent parts of documents based on their structure, headings, chapters, paragraphs as required; user control).~{ where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. }~ - -From markup that is simpler and more sparse than html you get: - -_* far greater output possibilities, including html, XML, ODF (OpenDocument), LaTeX (pdf), and SQL; - -_* the advantages implicit in the very different output possibilities; - -_* a common citation system (for all outputs - including the relational database, search results are relevant for all outputs); - -For more see the short summary of features provided below. - -SiSU processes files with minimal tagging to produce various document outputs including html, LaTeX or lout (which is converted to pdf) and if required loads the structured information into an SQL database (PostgreSQL and SQLite have been used for this). SiSU produces an intermediate processing format.~{ This proved to be the easiest way to develop syntax, changes could be made, or alternatives provided for the markup syntax whilst the intermediate markup syntax was largely held constant. There is actually an optional second intermediate markup format in YAML http://www.yaml.org/ }~ - -SiSU is used in constructing Lex Mercatoria http://lexmercatoria.org/ or http://www.jus.uio.no/lm/ (one of the oldest law web sites), and considerable thought went into producing output that would be suitable for legal and academic writings (that do not have formulae) given the limitations of html, and publication in a wide variety of "formats", in particular in relation to the convenient and accurate citation of text. However, the construction of Lex Mercatoria uses only a fraction of the features available from SiSU today, /vis/ generation of flat file structures, rather than in addition the building of ("granular") SQL database content, (at an object level with relevant relational tables, and other outputs also available). - -2~ Short summary of features *~summary - -!_ (i) -markup syntax: (a) simpler than html, (b) mnemonic, influenced by mail/messaging/wiki markup practices, (c) human readable, and easily writable, - -!_ (ii) -(a) minimal markup requirement, (b) single file marked up for multiple outputs, - -notes: - -* documents are prepared in a single UTF-8 file using a minimalistic mnemonic syntax. Typical literature, documents like "War and Peace" require almost no markup, and most of the headers are optional. - -* markup is easily readable/parsed by the human eye, (basic markup is simpler and more sparse than the most basic html), [this may also be converted to XML representations of the same input/source document]. - -* markup defines document structure (this may be done once in a header pattern-match description, or for heading levels individually); basic text attributes (bold, italics, underscore, strike-through etc.) as required; and semantic information related to the document (header information, extended beyond the Dublin core and easily further extended as required); the headers may also contain processing instructions. - -!_ (iii) -(a) multiple outputs primarily industry established and institutionally accepted open standard formats, include amongst others: plaintext (UTF-8); html; (structured) XML; ODF (Open Document text)l; LaTeX; PDF (via LaTeX); SQL type databases (currently PostgreSQL and SQLite). Also produces: concordance files; document content certificates (md5 or sha256 digests of headings, paragraphs, images etc.) and html manifests (and sitemaps of content). (b) takes advantage of the strengths implicit in these very different output types, (e.g. PDFs produced using typesetting of LaTeX, databases populated with documents at an individual object/paragraph level, making possible granular search (and related possibilities)) - -!_ (iv) -outputs share a common numbering system (dubbed "object citation numbering" (ocn)) that is meaningful (to man and machine) across various digital outputs whether paper, screen, or database oriented, (PDF, html, XML, sqlite, postgresql), this numbering system can be used to reference content. - -!_ (v) -SQL databases are populated at an object level (roughly headings, paragraphs, verse, tables) and become searchable with that degree of granularity, the output information provides the object/paragraph numbers which are relevant across all generated outputs; it is also possible to look at just the matching paragraphs of the documents in the database; [output indexing also work well with search indexing tools like hyperesteier]. - -!_ (vi) -use of semantic meta-tags in headers permit the addition of semantic information on documents, (the available fields are easily extended) - -!_ (vii) -creates organised directory/file structure for (file-system) output, easily mapped with its clearly defined structure, with all text objects numbered, you know in advance where in each document output type, a bit of text will be found (e.g. from an SQL search, you know where to go to find the prepared html output or PDF etc.)... there is more; easy directory management and document associations, the document preparation (sub-)directory may be used to determine output (sub-)directory, the skin used, and the SQL database used, - -!_ (viii) -"Concordance file" wordmap, consisting of all the words in a document and their (text/ object) locations within the text, (and the possibility of adding vocabularies), - -!_ (ix) -document content certification and comparison considerations: (a) the document and each object within it stamped with an md5 hash making it possible to easily check or guarantee that the substantive content of a document is unchanged, (b)version control, documents integrated with time based source control system, default RCS or CVS with use of $Id: sisu_description.sst,v 1.25 2007/08/23 12:22:36 ralph Exp $ tag, which SiSU checks - -!_ (x) -SiSU's minimalist markup makes for meaningful "diffing" of the substantive content of markup-files, - -!_ (xi) -easily skinnable, document appearance on a project/site wide, directory wide, or document instance level easily controlled/changed, - -!_ (xii) -in many cases a regular expression may be used (once in the document header) to define all or part of a documents structure obviating or reducing the need to provide structural markup within the document, - -!_ (xiii) -prepared files may be batch process, documents produced are static files so this needs to be done only once but may be repeated for various reasons as desired (updated content, addition of new output formats, updated technology document presentations/representations) - -!_ (xiv) -possible to pre-process, which permits: the easy creation of standard form documents, and templates/term-sheets, or; building of composite documents (master documents) from other sisu marked up documents, or marked up parts, i.e. import documents or parts of text into a main document should this be desired - -there is a considerable degree of future-proofing, output representations are "upgradeable", and new document formats may be added. - -!_ (xv) -there is a considerable degree of future-proofing, output representations are "upgradeable", and new document formats may be added: (a) modular, (thanks in no small part to Ruby) another output format required, write another module.... (b) easy to update output formats (eg html, XHTML, LaTeX/PDF produced can be updated in program and run against whole document set), (c) easy to add, modify, or have alternative syntax rules for input, should you need to, - -!_ (xvi) -scalability, dependent on your file-system (ext3, Reiserfs, XFS, whatever) and on the relational database used (currently Postgresql and SQLite), and your hardware, - -!_ (xvii) -only marked up files need be backed up, to secure the larger document set produced, - -!_ (xviii) -document management, - -!_ (xix) -Syntax highlighting for SiSU markup is available for a number of text editors. - -!_ (xx) -remote operations: (a) run SiSU on a remote server, (having prepared sisu markup documents locally or on that server, i.e. this solution where sisu is installed on the remote server, would work whatever type of machine you chose to prepare your markup documents on), (b) generated document outputs may be posted by sisu to remote sites (using rsync/scp) (c)document source (plaintext utf-8) if shared on the net may be identified by its url and processed locally to produce the different document outputs. - -!_ (xxi) -document source may be bundled together (automatically) with associated documents (multiple language versions or master document with inclusions) and images and sent as a zip file called a sisupod, if shared on the net these too may be processed locally to produce the desired document outputs, these may be downloaded, shared as email attachments, or processed by running sisu against them, either using a url or the filename. - -!_ (xxii) -for basic document generation, the only software dependency is Ruby, and a few standard Unix tools (this covers plaintext, html, XML, ODF, LaTeX). To use a database you of course need that, and to convert the LaTeX generated to PDF, a LaTeX processor like tetex or texlive. - -as a developers tool it is flexible and extensible - -SiSU was developed in relation to legal documents, and is strong across a wide variety of texts (law, literature...). SiSU handles images but is not suitable for formulae/ statistics, or for technical writing at this time. - -SiSU has been developed and has been in use for several years. Requirements to cover a wide range of documents within its use domain have been explored. - -Some modules are more mature than others, the most mature being Html and LaTeX / pdf. PostgreSQL and search functions are useable and together with /ocn/ unique (to the best of my knowledge). The XML output document set is "well formed" but largely proof of concept. - -2~ How it works - -SiSU markup is fairly minimalistic, it consists of: a (largely optional) document header, made up of information about the document (such as when it was published, who authored it, and granting what rights) and any processing instructions; and markup within text which is related to document structure and typeface. SiSU must be able to discern the structure of a document, (text headings and their levels in relation to each other), either from information provided in the instruction header or from markup within the text (or from a combination of both). Processing is done against an abstraction of the document comprising of information on the document's structure and its objects,~{ objects include: headings, paragraphs, verse, tables, images, but not footnotes/endnotes which are numbered separately and tied to the object from which they are referenced. }~ which the program serializes (providing the object numbers) and which are assigned hash sum values based on their content. This abstraction of information about document structure, objects, (and hash sums), provides considerable flexibility in representing documents different ways and for different purposes (e.g. search, document layout, publishing, content certification, concordance etc.), and makes it possible to take advantage of some of the strengths of established ways of representing documents, (or indeed to create new ones). - -2~ Simple markup - -SiSU markup is based on requiring the minimum markup needed to determine the structure of a document. (This can be as little as saying in a header to look for the word Book at a specified level and the word Chapter at another level). SiSU then breaks a document into its smallest parts (at a heading, and paragraph level) while retaining all structural information. This break up of the document and information on its structure is taken advantage of in the transformations made in generating the very different output types that can be created, and in providing as much as can be for what each output type is best at doing, e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), ODF (OpenDocument), SQL (e.g. document search; representing constituent parts of documents based on their structure, headings, chapters, paragraphs as required; user control).~{ where explicit structure is provided through the use of tagging headings, it could be reduced (still) further, for example by reducing the number of characters used to identify heading levels; but in many cases even that information is not required as regular expressions can be used to extract the implicit structure. }~ - -3~ Sparse markup requirement, try to get the most out of markup - -One of its strengths is that very small amounts of initial tagging is required for the program to generate its output. - -This is a basic markup example: - -_* { basic markup example, text file - an international convention }http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst ~{ http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst output provided as example in the next section }~ - -_* { view basic markup, as it would be highlighted by vim editor }http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html ~{ http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html as it would appear with syntax highlighting (by vim) }~ - -Emphasis has been on simplicity and minimalism in markup requirements. Design philosophy is to try keep the amount of markup required low, for whatever has been determined to be acceptable output.~{ seems there are several "smart ASCIIs" available, primarily for ascii to html conversion, that make this, and reasonable looking ascii their goal
    http://webseitz.fluxent.com/wiki/SmartAscii
    http://daringfireball.net/projects/markdown/
    http://www.textism.com/tools/textile/ }~ - -SiSU's markup is more minimalistic and simpler than (the equivalent) html and for it, you get considerably more than just html, as this preparation gives you all available output formats, upon request. - -3~ Single markup file provides multiple output formats - -For each document, there is only one (input, minimalistically marked up) file from which all the available output types are generated.~{ These include richly laid out and linked html (table or css variants), /PHP/, LaTeX (from which pdf portrait and landscape documents are produced), texinfo (for info files etc.), and PostgreSQL and/or SQLite. And the opportunity to fairly easily build additional modules, such as XML. See the examples provided in this document. }~ - -Eg. the markup example: - -_* {~^ original text file - an international convention }http://www.jus.uio.no/sisu/sample/markup/un_contracts_international_sale_of_goods_convention_1980.sst - -_* {~^ view as syntax would be highlighted by vim editor }http://www.jus.uio.no/sisu/sample/syntax/un_contracts_international_sale_of_goods_convention_1980.sst.html - -Produces the following output: - -_* {~^ Segmented html version of document }http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html - -_* {~^ Full length html document }http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/doc.html - -_* {~^ pdf landscape version of document }http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/landscape.pdf - -_* {~^ pdf portrait version of document }http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/portrait.pdf - -_* {~^ clean tex ascii version of document }http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/plain.txt - -_* {~^ /xml/ sax version of document }http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/sax.xml - -_* {~^ /xml/ dom version of document }http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/dom.xml - -_* {~^ Concordance }http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/concordance.html - -(and in addition to these: PostgreSQL, SQLite, texinfo and -{YAML}- ~{ discontinued for the time being }~ versions if desired) - -3~ Syntax relatively easy to read and remember - -Syntax is kept simple and mnemonic.~{ SiSU markup syntax, an incomplete summary: http://www.jus.uio.no/sisu/sisu_markup_table/doc.html#h200306
    Visual check of elementary font face modifiers: *bold* *{bold}* !{emphasis}! /{italics}/ _{underscore}_ -{strikethrough}- ^{superscript}^ ,{subscript}, }~ - -3~ Kept simple by having a limited publishing feature set, and features identified as most important, are available across several document types - -To keep SiSU markup sparse and simple SiSU deliberately provides a limited publishing feature set, including: indent levels; bold; italics; superscript; subscript; simple tables; images; tables of contents and; endnotes. Which in most cases are available across the different output formats. - -The publishing feature set may be expanded as required. - -2~ Designed with usability in mind - -Output is designed to be uniform, easy to read, navigate and cite. - -2~ Code separate from content - -Code~{ the program that generates the documents }~ is separated from content. This means that when changes are desired in the output presentation, the code that produces them, and not the marked up text data set (which could be thousands of documents) is modified. Separating code from content makes large scale changes to output appearance trivial, and permits the easy addition of new output modules. - -2~ Object citation numbering, a text or object positioning / citation system - "paragraph" (or text object) numbering, that remains same and usable across all output formats by people and machine *~citation *~ocn - -Object citation numbering is a simple object (text) positioning and cition system that is human relevant and machine useable, used by SiSU for all manner of presentations, and that is available for use in all text mappings. It is based on the automated sequential numbering of objects (roughly paragraphs, (headings, tables, verse) or other blocks of text or images etc.). The text positioning system (in which I claim copyright) is invaluable for publishing requiring the citing text across multiple output formats, and for the general mapping of text within a document: - -_* in html, html not being easily citeable (change font size, or use a different browser and the page on which specific text appears has changed), and - -_* across multiple formats being common to all output formats html/xml/pdf/sql output, - -_* the results of an sql search can just be "live" citation references to the documents in which the text is found, {~^ much like an index (see image examples provided). }http://www.jus.uio.no/sisu/SiSU/1.html#search - -I claim copyright on the system I use which is the most basic of all, numbering all text in headings and paragraphs sequentially (with tables and images being treated as a single paragraph) and only footnotes/endnotes not following this numbering, as their position in text is not strictly determined, (a change from footnotes to endnotes would change their numbering), footnotes instead "belong" to the paragraph from which they are referenced, and have sequential numbers of their own. - -SiSU has a paragraph numbering system, that remains the same regardless of the output format. This provides an effective means of citation, pinpointing text accurately in all output formats, using the same reference. This is particularly useful where text has to be located across different output formats - for example once html is printed the number of pages and pages on which given text is found will vary depending on the browser, its settings the font size setting etc. Similarly SiSU produces pdf in different forms, eg. on the example site Lex Mercatoria as portrait and landscape documents - here too page numbering varies, but paragraph numbering is the same, /{vis a vis}/ all versions of the text (portrait and landscape pdf and the html versions of the text, and as stored (with "paragraphs" as records) to the PostgreSQL or SQLite database). - -These numbers are placed in the text margins and are intended to be independent of and not to interfere with authors tagging. [The citation system (object citation numbering system, automated "paragraph numbering") which is automatically generated and is common and identical across all document formats] The paragraph numbering system is more accurately described as an (text) object numbering system, as headings are also numbered... all headings and paragraphs are numbered sequentially. Endnotes are automatically numbered independently and rather "belong" to the paragraph from which they are referenced, as an endnote does not (necessarily) form a part of a documents sequence, (they may be produced as either endnotes or footnotes (or both depending on what output you choose to look at - if you take the segmented html version document provided as an example, you will find that the endnotes are placed both at the end of each section, and in a separate section of their own called endnotes, and these are hyper-linked)). An attractive feature of providing citation numbering in this way is that it is independent of the document structure... it remains the same regardless of what is done about the document structure. - -The rules have been kept very simple, unique incremental object citation numbers are assigned to headings, paragraphs, verse, tables and images. It is possible to manually override this feature on a per heading or comment basis though this should be used exceptionally, it may be of use where there a substantive text, and the addition of a minor comment by the publisher that should not be mapped as part of the text. - -The object citation number markers contain additional numbering information with regard to the document structure, that can be used for alternative presentations, including such detail as the type of object (heading, paragraph, table, image, etc.), numbered sequentially. - -An advantage is that the numbering remains the same regardless of document structure. - -Text object ("paragraph") numbering is the same for all output versions of the same document, vis html, pdf, pgsql, yaml etc. - -In the relational database, as individual text objects of a document stored (and indexed) together with object numbers, and all versions of the document have the same numbering, the results of searches may be tailored just to provide the location of the search result in all available document formats. - -/{ Note: there is a bug in the released behaviour of object citation numbering, (not certain when it was introduced) tables should be numbered, ie each table gets an ocn, required amongst other things for relational database. This will be corrected in a future release. Citation numbering of existing documents that contain tables will changed. }/ - -2~ Handling of Dublin Core meta-tags making use of the Resource Description Framework - -SiSU is able to use meta tags based on the Dublin Core~{ http://dublincore.org/ }~ and Resource Description Framework~{ http://www.w3.org/RDF/ }~ - -This provides the means of providing semantic information about a document, both as computer processable meta-tags, and as human readable information that may be of value for classification purposes. - -This information is provided both in html metatags, and (where available) under the section titled "Document Information - MetaData", near the end of a document, for example in the segmented html version of this text at: http://www.jus.uio.no/sisu/SiSU/metadata.html - -2~ Easy directory management - -#1 Directory file association, skins and special image management, made simpler.~{ The previous way was directory associations for file output were set up in the configuration file. The present system is a more natural way to work requireing less configuration. }~ - -The last part of the name of the work directory in which markup is being done, or rather from where SiSU is run in order to generate document output, is used in determining the sub-directory name for output files, that is created in the document output directory. This provides a rather easy way to associate documents e.g. of a given subject, or by owner. - -code{ - -/www/docs - /intellectual_property - /arbitration - /contract_law - -/www/docs - /ralph - /sisu - -}code - -all are placed in their own directories within the directory structure created. Similar rules are used in the creation of sql type databases (though they can be overridden). - -There are a couple of further associations with these directories. - -Directory wide skins. - -Directory specific images. - -# If there is a "directory skin", that is a skin of the same name as the directory, it is used in the generation of the documents within it, rather than the default skin, unless the document has a specific skin associated with it. - -_# default skin (always available) - -_# directory skin (precedence over default if exists) - -_# document skin (takes precedence wherever document requests a specific skin) - -Skins are defined in the document skin directory and if a directory association is desired a softlink made to the relevant skin. Skins (directory association auto load) auto load skin if a directory skin exists of same name as directory stub, (and there is no specific doc skin) - -# If the working directory has within it a sub-directory called image_local, the images within that directory are used for references to images, that are not part of the default site build. - -2~ Document Version Control Information - -The possibility of citing an exact document version. - -Permits the inclusion of document version control information to the document body and metatags.~{ from a version control system such as CVS }~ This provides a much more certain method of referring to the exact version of a particular document, (assuming that the document is from a trusted source, that will retain earlier versions of a document).~{ The version control system must be run, so the version number is obtained, prior to the SiSU document generation, and subsequent posting of the document. }~ - -This information (where available) is provided under the section of the document titled "Document Information - MetaData", near the end of a document, for example in the segmented html version of this text at: http://www.jus.uio.no/sisu/SiSU/metadata.html - -2~ Table of contents - -SiSU produces a rudimentary a table of contents based on document headings. - -2~ Auto-numbering of headings - -Headings can be automatically numbered, (and automatically named for hyper-linking) - -2~ Numbering and cross-hyperlinking of endnotes - -SiSU can automatically number footnotes/endnotes. This is the default operation where no number is provided. - -Footnotes/endnotes may also be manually numbered. Where a number, or numbers are provided for a footnote/endnote, this does not increment the automatic footnote/endnote number counter. - -In the html output footnotes/endnotes are cross-hyper-linked (to their reference point and vice versa). In th pdf output footnotes are linked from their reference point only. - -2~ "Skinnable" - -SiSU is skinnable, on a site-wide, directory-wide and per document basis, so different looking versions of things may be produced with little difficulty. There is a default skin which may be modified, as the background site skin, and each working directory may have a skin associated with it, as may each individual document. The hierarchy of application is document, directory, then site... ie if a document skin exists it gets precedence. - -Whilst it is skinnable, the default output styles are selected to work across the widest possible range of document types. - -2~ Multiple Outputs - -From markup that is simpler and more sparse than html you get: - -_* far greater output possibilities, including multiple html types, XML (different structured types), LaTeX (pdf landscape, portrait), and SQL (Postgresql or SQLite or other); - -_* the advantages implicit in these very different output possibilities;~{ e.g. LaTeX (professional document typesetting, easy conversion to pdf or Postscript), XML (in this case, structural representation), SQL (e.g. document set searches; representation of the constituent parts of documents based on their structure, headings, chapters, paragraphs as desired; control of use) }~ - -_* a common citation system - -As many output formats/presentations as one cares to write modules for - several types of html (e.g. structure based on css, or structure based on tables); /{LaTeX/pdf}/ and /{Lout/pdf}/; pgsql other databases easily added; yaml... - -3~ html - several presentations: full length & segmented; css & table based - -Most documents are produced in single and segmented html versions, described below: - -!_ The Scroll (full length text presentations) - -The full length of the text in a single scrollable document.~{ CISG http://www.jus.uio.no/lm/un_contracts_international_sale_of_goods_convention_1980/doc
    The Unidroit Contract Principles http://www.jus.uio.no/lm/unidroit.contract.principles.1994/doc or
    The Autonomous Contract http://www.jus.uio.no/lm/autonomous.contract.2000.amissah/doc }~ As a rule the files they are saved in are named: /doc/ or more precisely /{doc.html}/ - -For various reasons texts may only be provided in this form (such as this one which is short), though most are also provided as segmented texts. - -"Scroll" is a reference to the historical scroll, a single long document/ parchment, and also no doubt to what you will have to do to get to the bottom of the text.~{ Scrolling is not however necessarily confined to full length documents as you will have to scroll to get to the bottom of any long segment (eg. chapter) of a segmented text. }~ - -!_ The Segmented Text - -The text divided into segments (such as articles or chapters depending on the text)~{ CISG http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980
    The Unidroit Principles http://www.jus.uio.no/lm/unidroit.contract.principles.1994
    The Autonomous Contract http://www.jus.uio.no/sisu/the.autonomous.contract.2000.amissah or
    WTA 1994 http://www.jus.uio.no/lm/wta.1994 }~ As a rule the files they are saved in are named: /toc/ and /index/ or more precisely /{toc.html}/ and /{index.html}/ - -If you know exactly what you are looking for, loading a segment of text is faster (the segments being smaller). Occasionally longer documents such as the WTA 1994 http://www.jus.uio.no/lm/wta.1994/toc are only provided in segmented form. - -!_ Cascading Style Sheet, and Table based html - -SiSU outputs html, two current standard forms available are: - -{ css based }http://www.jus.uio.no/sisu/SiSU/toc.html - -and - -table based [largely discontinued]~{ formatting possibility still exists in code tree but maintenance has been largely discontinuted. }~ - -!_ The html is tested across several browsers - -I like to remind you that there are other excellent browsers out there, many of which have long supported practical features like tabbing. - -The html is tested across several browsers, including: - -_* {~^ *Firefox* (Mozilla-Firefox) }http://www.mozilla.org/products/firefox/ - -_* {~^ Kazehakase }http://kazehakase.sourceforge.jp/ - -_* {~^ Konqueror }http://www.konqueror.org/ - -_* {~^ Mozilla }http://www.mozilla.org/ - -_* {~^ MS Internet Explorer }http://www.microsoft.com/windows/ie/default.asp - -_* {~^ Netscape }http://home.netscape.com/comprod/mirror/client_download.html - -_* {~^ Opera }http://www.opera.com/ - -Also lighter weight graphical browsers: - -_* {~^ Dillo }http://www.dillo.org/ - -_* {~^ *Epiphany* }http://www.gnome.org/projects/epiphany/ - -_* {~^ *Galeon* }http://galeon.sourceforge.net/ - -And for console/text browsing: - -_* {~^ *elinks* }http://elinks.or.cz/ - -_* {~^ *links2* }http://links.twibright.com/ - -_* {~^ *w3m* }http://w3m.sourceforge.net/ - -The html tables output is rendered more accurately across a wider variety set and older versions of browsers (than the html css output). - -3~ XML - -SiSU generates well formed XML, and multiple versions. An XML SAX version with a flat/shallow structure, and XML DOM version with a deeper (embedded) structure. There is also a released working xhtml module. Examples of SAX and DOM versions are provided within this document. - -3~ ODT:ODF, Open Document Format - ISO/IEC 26300:2006 - -SiSU generates Open Document Output format. - -3~ PDF - portrait and landscape, (through the generation of LaTeX output which is then transformed to pdf) - -SiSU outputs LaTeX if required which is easily transformed to PDF.~{ LaTeX and pdf features introduced 18^th^ June 2001, Landscape and portrait pdfs introduced 7^th^ October 2001., Lout is a more recent addition 22^th^ April 2003 }~ PDF documents are generated on the site from the same source files and Ruby program that produce html. Landscape oriented pdf introduced, providing easier screen viewing, they are also (paper saving, being currently) formatted to have fewer pages than their portrait equivalents. - -_* {~^ Adobe Reader }http://www.adobe.com/products/acrobat/readstep2.html - -_* {~^ *Evince* }http://www.gnome.org/projects/evince/ - -_* {~^ xpdf }http://www.foolabs.com/xpdf/ - -3~ Search - loading/populating of relational database while retaining document structure information, object citation numbering and other features (currently PostgreSQL and/or SQLite) - -SiSU (from the same markup input file) automatically feeds into PostgreSQL~{ http://www.postgresql.org/
    http://advocacy.postgresql.org/
    http://en.wikipedia.org/wiki/Postgresql }~ and/or SQLite~{ http://www.hwaci.com/sw/sqlite/
    http://en.wikipedia.org/wiki/Sqlite }~ database (could be any other of the better relational databases)~{ Relational database features retaining document structure and citation introduced 15^th^ July 2002 }~ - together with all additional information related to document structure, and the alternative ways in which it is generated on the site retained. As regards scaling of the database, it is as scalable as the database (here Postgresql or SQLite) and hardware allow. I will prune the images later. - -This is one of the more interesting output forms, as all the structural data for the documents are retained (though can be ignored by the user of the database should they so choose). All site texts/documents are (currently) streamed to four pgsql database tables: - -_1* one containing semantic (and other) headers, including, title, author, subject, (the Dublin Core...); - -_1* another the substantive texts by individual "paragraph" (or object) - along with structural information, each paragraph being identifiable by its paragraph number (if it has one which almost all of them do), and the substantive text of each paragraph quite naturally being searchable (both in formatted and clean text versions for searching); and - -_1* a third containing endnotes cross-referenced back to the paragraph from which they are referenced (both in formatted and clean text versions for searching). - -_1* a fourth table with a one to one relation with the headers table contains full text versions of output, eg. pdf, html, xml, and ascii. - -There is of course the possibility to add further structures. - -At this level SiSU loads a relational database with documents broken in to their smallest logical structurally constituent parts, as text objects, with their object citation number and all other structural information needed to construct the structured document. Text is stored (at this text object level) with and without elementary markup tagging, the stripped version being so as to facilitate ease of searching. - -Because the document structure of sites created is clearly defined, and the text object citation system is available for all forms of output, it is possible to search the sql database, and either read results from that database, or just as simply map the results to the html output, which has richer text markup. - -The combination of the SiSU citation system with a relational database is pretty powerful, giving rise to several possibilities. As individual text objects of a document stored (and indexed) together with object numbers, and all versions of the document have the same numbering, complex searches can be tailored to return just the locations of the search results relevant for all available output formats, with live links to the precise locations in the database or in html/xml documents; or, the structural information provided makes it possible to search the full contents of the database and have headings in which search content appears, or to search only headings etc. (as the Dublin Core is incorporated it is easy to make use of that as well). - -This is a larger scale project, (with little development on the front end largely ignored), though the "infrastructure" has been in place since 2002. - -3~ Search - database frontend sample, utilising database and SiSU features, including object citation numbering (backend currently PostgreSQL) *~search - -{~^ Sample search frontend }http://search.sisudoc.org -A small database and sample query front-end (search from) that makes use of the citation system, _{object citation numbering}_ to demonstrates functionality.~{ (which could be extended further with current back-end). As regards scaling of the database, it is as scalable as the database (here Postgresql) and hardware allow. }~ - -SiSU can provide information on which documents are matched and at what locations within each document the matches are found. These results are relevant across all outputs using object citation numbering, which includes html, XML, LaTeX, PDF and indeed the SQL database. You can then refer to one of the other outputs or in the SQL database expand the text within the matched objects (paragraphs) in the documents matched. - -(further work needs to be done on the sample search form, which is rudimentary and only passes simple booleans correctly at present to the SQL engine) - -A few canned searches, showing object numbers. Search for: - -{ English documents matching Linux OR Debian }http://search.sisudoc.org?s1=Linux%2BOR%2BDebian&lang=En&db=SiSU_sisu&view=index&a=1 - -{ GPL OR Richard Stallman }http://search.sisudoc.org?s1=GPL%2BOR%2BRichard%2BStallman&lang=En&db=SiSU_sisu&view=index&a=1 - -{ invention OR innovation in English language }http://search.sisudoc.org?s1=invention%2BOR%2Binnovation&lang=En&db=SiSU_sisu&view=index&a=1 - -{ copyright in English language documents }http://search.sisudoc.org?s1=copyright&lang=En&db=SiSU_sisu&view=index&a=1 - -Note that the searches done in this form are case sensitive. - -Expand those same searches, showing the matching text in each document: - -{ English documents matching Linux OR Debian }http://search.sisudoc.org?s1=Linux%2BOR%2BDebian&lang=En&db=SiSU_sisu&view=text&a=1 - -{ GPL OR Richard Stallman }http://search.sisudoc.org?s1=GPL%2BOR%2BRichard%2BStallman&lang=En&db=SiSU_sisu&view=text&a=1 - -{ invention OR innovation in English language }http://search.sisudoc.org?s1=invention%2BOR%2Binnovation&lang=En&db=SiSU_sisu&view=text&a=1 - -{ copyright in English language documents }http://search.sisudoc.org?s1=copyright&lang=En&db=SiSU_sisu&view=text&a=1 - -Note you may set results either for documents matched and object number locations within each matched document meeting the search criteria; or display the names of the documents matched along with the objects (paragraphs) that meet the search criteria.~{ of this feature when demonstrated to an IBM software innovations evaluator in 2004 he said to paraphrase: this could be of interest to us. We have large document management systems, you can search hundreds of thousands of documents and we can tell you which documents meet your search criteria, but there is no way we can tell you without opening each document where within each your matches are found. }~ - -!_ OCN index mode, -(object citation number) the numbers displayed are relevant (and may be used to reference the match) in any sisu generated rendition of the text~{ OCN are provided for HTML, XML, pdf ... though currently omitted in plain-text and opendocument format output }~ the links provided are to the locations of matches within the html generated by SiSU. - -!_ Paragraph mode, -you may alternatively display the text of each paragraph in which the match was made, again the object/paragraph numbers are relevant to any SiSU generated/published text. - -Several options for output - select database to search, show results in index view (links to locations within text), show results with text, echo search in form, show what was searched, create and show a "canned url" for search, show available search fields. Also shows counters number of documents in which found and number of locations within documents where found. [could consider sorting by document with most occurrences of the search result]. - -Earlier version of the search frontend - Simple search, results with files in which search found, and locations where found within files. - -Simple search, results with files in which search found, and text object (paragraph or endnote) where found within files. - -3~ Other forms - -There are other forms as well, YAML file, Ruby Marshal dumps, document pre-processing (processing of documents prior to the steps described here, to produce input suitable for the program) snap in a new module as required/desired, well formed XML, no problem. - -2~ Concordance / Word Map or rudimentary index - -Concordance /WordMaps:~{ Concordance/ WordMaps introduced 15^th^ August 2002 }~ SiSU produces a rudimentary index based on the words within the text, making use of paragraph numbers to identify text locations. This is generated in html and hyper-linked but identifies these words locations in the other document formats. Though it is possible to search using a search engine, this is a means for browsing an alphabetical list of words which may suggest other useful content. - -% Concordance files may be built using a document vocabulary. The vocabulary to be used may be specified on a per document basis. - -2~ Managed (document) directory, database, or site structure - -SiSU builds the web site (or more generically provides a suitable directory structure) - placing various output texts in the hierarchy of the web-site (or db), which (for directories) is a sub-directory with the name of the text file. - -2~ Batch processing - -SiSU is a batch processing tool, handling and transforming multiple (or individual) documents (in many ways) with a single instruction. - -2~ Integration to superior Gnu/Linux and Unix tools - -As should have been noted by the above description of SiSU, it makes use of existing programs found on Gnu/Linux and Unix, amongst those already mentioned include the LaTeX to pdf converters and the database PostgreSQL or SQLite. - -3~ Backup and version control - -Unix provides many tools for version control. For documents Subversion, CVS and even the old RCS are useful for the per-document histories they provide. - -For writing code superior (more recent) version control system exist. These can also be used for documents though they tend to take stamps of changes across the repository as a whole, rather than for each individual file that is tracked, (as CVS and RCS do). My personal preference is for distributed systems such as Git, Mercurial or Darcs, of which I use Git for both code and documents. - -Several backup tools exist. At the base level I tend to use rdiff. - -3~ Editor support - -SiSU documents are prepared / marked up in utf-8 text _{you are free to use the text editor of your choice.}_ - -Syntax highlighting for a number of editors are provided. Amongst them Vim, Kwrite, Kate, Gedit and diakonos. These may be found with configuration instructions at http://www.jus.uio.no/sisu/syntax_highlight. {~^ Vim }http://www.vim.org/ as of version 7 has built in sytax highlighting for SiSU. - -2~ Modular design, need something new add a module - -Need a new output format that does not already exist, write a new module. - -Prefer a new input syntax, you could write a new syntax matching the existing design, though my personal preference is some uniformity in entry appearance. If necessary has been fairly easy to extend the design parameters. It is intended to incorporate some additional basic semantic tagging, (book, article, author etc.) However, keeping the requirements for input minimal, and relatively simple has been a design goal. - diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/sisu_manifest.html b/data/doc/manuals_generated/sisu_manual/sisu_description/sisu_manifest.html deleted file mode 100644 index ea1a588e..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/sisu_manifest.html +++ /dev/null @@ -1,197 +0,0 @@ - - - - -SiSU manifest: SiSU - - - - - - - - - - -
    SiSU -->

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/>

    - -
    - - -
    - - - idx - txt - -
    -
    -
    - -

    SiSU - Description

    -

    Ralph Amissah

    - -

    SiSU manifest

    - - - - - - - - - - - - - - -

    description

    filename

    file size

    (kB)

    TOC linked table of contents (for segmented HTML text)

    toc.html

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/toc.html>

    19.9

    Full Text full length document (HTML)

    doc.html

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/doc.html>

    187.7

    PDF portrait full length document (PDF portrait / vertical - recommended for printing)

    portrait.pdf

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/portrait.pdf>

    214.1

    PDF landscape full length document (PDF landscape / horizontal - recommended for screen viewing)

    landscape.pdf

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/landscape.pdf>

    209.7

    ODF/ODT full length document (ODF:ODT - Open Document Format)

    opendocument.odt

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/opendocument.odt>

    27.8

    full length document (XHTML)

    scroll.xhtml

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/scroll.xhtml>

    90.3

    full length document (XML SAX)

    sax.xml

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/sax.xml>

    92.9

    full length document (XML DOM)

    dom.xml

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/dom.xml>

    100.0

    full length document (plaintext Unix (UTF-8) with footnotes)

    plain.txt

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/plain.txt>

    59.0

    Concordance file (HTML - wordmap, alphabetical wordlist)

    concordance.html

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/concordance.html>

    322.0

    Digest/DCC - Document Content Certificate (md5)

    digest.txt

    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/digest.txt>

    31.2

    - - -

    Markup (SiSU source)

    sisu_description.sst   b89ccdad9f6d9c2260d8d383d6b35ccc
    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/sisu_description.sst>

    53.4

    -

    SiSU doc (zip)

    sisupod.zip   5e579282a37a5b125996e11b95c888ab
    <http://www.jus.uio.no/sisu/sisu_manual/sisu_description/sisupod.zip>

    20.1

    -

    SiSU manifest of document metadata

    - - - - - - - - - - - - - - - - - - - - - - - -

    metadata

    description

    DC Title:

    SiSU - Description

    Title:

    SiSU

    Subtitle:

    Description

    DC Creator:

    Ralph Amissah

    DC Publisher:

    SiSU http://www.jus.uio.no/sisu (this copy)

    DC Subject:

    ebook, epublishing, electronic book, electronic publishing, electronic document, electronic citation, data structure, citation systems, search

    DC Type:

    information

    DC Rights:

    Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3

    DC Date:

    2007-08-30

    DC Date created:

    2002-11-12

    DC Date issued:

    2002-11-12

    DC Date available:

    2002-11-12

    DC Date modified:

    2007-08-30

    DC Language:

    English

    DC Identifier:

    http://www.jus.uio.no/sisu/sisu_manual/sisu_description/toc.html

    Sourcefile:

    sisu_description.sst

    Word Count approximate:

    7681

    Sourcefile Digest (md5):

    b89ccdad9f6d9c2260d8d383d6b35ccc

    Document (metaverse) last generated:

    Tue Sep 25 02:54:06 +0100 2007

    Generated by:

    SiSU 0.59.1 2007w39/2 (2007-09-25)

    Ruby version:

    ruby 1.8.6 (2007-06-07 patchlevel 36) [i486-linux]

    - - - - - - - - - - - - -

    metadata suggested links:

    SiSU Manual

      <http://www.jus.uio.no/sisu/sisu_manual/>

    Book Samples and Markup Examples

      <http://www.jus.uio.no/sisu/SiSU/2.html>

    SiSU @ Wikipedia

      <http://en.wikipedia.org/wiki/SiSU>

    SiSU @ Freshmeat

      <http://freshmeat.net/projects/sisu/>

    SiSU @ Ruby Application Archive

      <http://raa.ruby-lang.org/project/sisu/>

    SiSU @ Debian

      <http://packages.qa.debian.org/s/sisu.html>

    SiSU Download

      <http://www.jus.uio.no/sisu/SiSU/download.html>

    SiSU Changelog

      <http://www.jus.uio.no/sisu/SiSU/changelog.html>

    SiSU help

      <http://www.jus.uio.no/sisu/sisu_manual/sisu_help/>

    SiSU help sources

      <http://www.jus.uio.no/sisu/sisu_manual/sisu_help_sources/>

    -

    Document Language Versions, manifests

    - - - -

    filename

    description

     

    - - - - - - - - -
    - -
    -

    - - SiSU - -

    -
    - -

    - Output generated by - - SiSU - - 0.59.1 2007-09-25 (2007w39/2) -
    - - SiSU - - Copyright © Ralph Amissah - 1997, current 2007. - All Rights Reserved. -
    - - SiSU - - is software for document structuring, publishing and search, -
    - - www.jus.uio.no/sisu - - and - - www.sisudoc.org - -
    - w3 since October 3 1993 - - ralph@amissah.com - -

    -
    -

    - SiSU using: -
    Standard SiSU markup syntax, -
    Standard SiSU meta-markup syntax, and the -
    Standard SiSU object citation numbering and system, (object/text positioning system) -
    - Copyright © Ralph Amissah 1997, current 2007. - All Rights Reserved. -

    -
    -

    - - GPLv3 - -

    -
    -

    - - SiSU - - is released under - GPLv3 - or later, - <http://www.gnu.org/licenses/gpl.html> -

    -
    -

    - SiSU, developed using - - Ruby - - on - - Debian/Gnu/Linux - - software infrastructure, - with the usual GPL (or OSS) suspects. -
    - Better - "performance, reliability, scalability, security & total cost of ownership" - [not to mention flexibility & choice] use of and adherence to open standards (where practical and fair) and it is software libre. -
    - Get With the Future - - Way Better! - -

    -
    - - diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/sisupod.zip b/data/doc/manuals_generated/sisu_manual/sisu_description/sisupod.zip deleted file mode 100644 index 060565dd..00000000 Binary files a/data/doc/manuals_generated/sisu_manual/sisu_description/sisupod.zip and /dev/null differ diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/sitemap.xml b/data/doc/manuals_generated/sisu_manual/sisu_description/sitemap.xml deleted file mode 100644 index c842adac..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/sitemap.xml +++ /dev/null @@ -1,30 +0,0 @@ - - - - - - http://www.jus.uio.no/sisu/sisu_manual/sisu_description/toc.html - 2007-08-30 - monthly - 0.7 - - - http://www.jus.uio.no/sisu/sisu_manual/sisu_description/doc.html - 2007-08-30 - 0.5 - - - http://www.jus.uio.no/sisu/sisu_manual/sisu_description/sisu_manifest.html - 2007-08-30 - 0.5 - - diff --git a/data/doc/manuals_generated/sisu_manual/sisu_description/toc.html b/data/doc/manuals_generated/sisu_manual/sisu_description/toc.html deleted file mode 100644 index dcdd4375..00000000 --- a/data/doc/manuals_generated/sisu_manual/sisu_description/toc.html +++ /dev/null @@ -1,704 +0,0 @@ - - - - - - SiSU - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
    - - - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - -
    - - TOC linked  toc  - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    -   - next >> -   -
    -

    - -

    - -
    - - -
    - -

    SiSU -

    - -

    Description -

    - -

    Ralph Amissah -

    - -copy @ - - SiSU - - -
    - - - -

    - SiSU - Description,
    Ralph Amissah -

    - -

    - SiSU an attempt to describe -

    - -

    - - 1. Description - -

    - -
    - - 1.1 Outline - -
    - -
    - - 1.2 Short summary of features - -
    - -
    - - 1.3 How it works - -
    - -
    - - 1.4 Simple markup - -
    - -
    - - 1.4.1 Sparse markup requirement, try to get the most out of markup - -
    - -
    - - 1.4.2 Single markup file provides multiple output formats - -
    - -
    - - 1.4.3 Syntax relatively easy to read and remember - -
    - -
    - - 1.4.4 Kept simple by having a limited publishing feature set, and features identified as most important, are available across several document types - -
    - -
    - - 1.5 Designed with usability in mind - -
    - -
    - - 1.6 Code separate from content - -
    - -
    - - 1.7 Object citation numbering, a text or object positioning / citation system - "paragraph" (or text object) numbering, that remains same and usable across all output formats by people and machine - -
    - -
    - - 1.8 Handling of Dublin Core meta-tags making use of the Resource Description Framework - -
    - -
    - - 1.9 Easy directory management - -
    - -
    - - 1.10 Document Version Control Information - -
    - -
    - - 1.11 Table of contents - -
    - -
    - - 1.12 Auto-numbering of headings - -
    - -
    - - 1.13 Numbering and cross-hyperlinking of endnotes - -
    - -
    - - 1.14 "Skinnable" - -
    - -
    - - 1.15 Multiple Outputs - -
    - -
    - - 1.15.1 html - several presentations: full length & segmented; css & table based - -
    - -
    - - 1.15.2 XML - -
    - -
    - - 1.15.3 ODT:ODF, Open Document Format - ISO/IEC 26300:2006 - -
    - -
    - - 1.15.4 PDF - portrait and landscape, (through the generation of LaTeX output which is then transformed to pdf) - -
    - -
    - - 1.15.5 Search - loading/populating of relational database while retaining document structure information, object citation numbering and other features (currently PostgreSQL and/or SQLite) - -
    - -
    - - 1.15.6 Search - database frontend sample, utilising database and SiSU features, including object citation numbering (backend currently PostgreSQL) - -
    - -
    - - 1.15.7 Other forms - -
    - -
    - - 1.16 Concordance / Word Map or rudimentary index - -
    - -
    - - 1.17 Managed (document) directory, database, or site structure - -
    - -
    - - 1.18 Batch processing - -
    - -
    - - 1.19 Integration to superior Gnu/Linux and Unix tools - -
    - -
    - - 1.19.1 Backup and version control - -
    - -
    - - 1.19.2 Editor support - -
    - -
    - - 1.20 Modular design, need something new add a module - -
    - -

    - - Endnotes - -

    - - - Document Information (metadata) - - -

    Rights: Copyright ©  Ralph Amissah 2007, part of SiSU documentation, License GPL 3

    -

    - -

    - - - - - -
    - -
    SiSU -->
    -
    - - - - - - - - - - - - - - - -
    - - TOC linked  toc  - - - - - Full Text  scroll  - - - - - - PDF portrait -   pdf  - - - - - - PDF landscape -   pdf  - - - - - - ODF/ODT  odt  - - - - - - -   A-Z  - - - - - - Document Manifest  - - - - - @ -
    -
    -   - next >> -   -
    -

    - -

     

    - - - - - - - - -
    - -
    -

    - - SiSU - -

    -
    - -

    - Output generated by - - SiSU - - 0.59.1 2007-09-25 (2007w39/2) -
    - - SiSU - - Copyright © Ralph Amissah - 1997, current 2007. - All Rights Reserved. -
    - - SiSU - - is software for document structuring, publishing and search, -
    - - www.jus.uio.no/sisu - - and - - www.sisudoc.org - -
    - w3 since October 3 1993 - - ralph@amissah.com - -

    -
    -

    - SiSU using: -
    Standard SiSU markup syntax, -
    Standard SiSU meta-markup syntax, and the -
    Standard SiSU object citation numbering and system, (object/text positioning system) -
    - Copyright © Ralph Amissah 1997, current 2007. - All Rights Reserved. -

    -
    -

    - - GPLv3 - -

    -
    -

    - - SiSU - - is released under - GPLv3 - or later, - <http://www.gnu.org/licenses/gpl.html> -

    -
    -

    - SiSU, developed using - - Ruby - - on - - Debian/Gnu/Linux - - software infrastructure, - with the usual GPL (or OSS) suspects. -
    - Better - "performance, reliability, scalability, security & total cost of ownership" - [not to mention flexibility & choice] use of and adherence to open standards (where practical and fair) and it is software libre. -
    - Get With the Future - - Way Better! - -

    -
    - - - - - - - -
    - - - - - - -- cgit v1.2.3