SiSU -->
[ document manifest ]
<< previous TOC next >>
< ^ >

SiSU - SiSU information Structuring Universe - Structured information, Serialized Units,
Ralph Amissah

Structured information, Serialized Units

SiSU - from less markup than the most elementary equivalent html, you can have more

1. Description

1.1 Outline
1.2 Short summary of features
1.3 How it works
1.4 Simple markup
1.4.1 Sparse markup requirement, try to get the most out of markup
1.4.2 Single markup file provides multiple output formats
1.4.3 Syntax relatively easy to read and remember
1.4.4 Kept simple by having a limited publishing feature set, and features identified as most important, are available across several document types
1.5 Designed with usability in mind
1.6 Code separate from content
1.7 Object citation numbering, a text or object positioning / citation system - "paragraph" (or text object) numbering, that remains same and usable across all output formats by people and machine
1.8 Handling of Dublin Core meta-tags making use of the Resource Description Framework
1.9 Easy directory management
1.10 Document Version Control Information
1.11 Table of contents
1.12 Auto-numbering of headings
1.13 Numbering and cross-hyperlinking of endnotes
1.14 "Skinnable"
1.15 Multiple Outputs
1.15.1 html - several presentations: full length & segmented; css & table based
1.15.2 EPUB
1.15.3 XML
1.15.4 ODT:ODF, Open Document Format - ISO/IEC 26300:2006
1.15.5 PDF - portrait and landscape, (through the generation of LaTeX output which is then transformed to pdf)
1.15.6 Search - loading/populating of relational database while retaining document structure information, object citation numbering and other features (currently PostgreSQL and/or SQLite)
1.15.7 Search - database frontend sample, utilising database and SiSU features, including object citation numbering (backend currently PostgreSQL)
1.15.8 Other forms
1.16 Concordance / Word Map or rudimentary index
1.17 Managed (document) directory, database, or site structure
1.18 Batch processing
1.19 Integration to superior Gnu/Linux and Unix tools
1.19.1 Backup and version control
1.19.2 Editor support
1.20 Modular design, need something new add a module

2. Markup and Output Examples

2.1 Markup examples
2.2 A few book (and other) examples
2.2.1 "Viral Spiral", David Bollier
"The Wealth of Networks", Yochai Benkler
"Two Bits", Christopher Kelty
"Free Culture", Lawrence Lessig
"CONTENT", Cory Doctorow
"Democratizing Innovation", by Eric von Hippel
"Free as in Freedom: Richard Stallman's Crusade for Free Software", by Sam Williams
"Free For All: How Linux and the Free Software Movement Undercut the High Tech Titans", by Peter Wayner
"The Cathedral and the Bazaar", by Eric S. Raymond
"Down and out in the Magic Kingdom", Cory Doctorow
"Little Brother", Cory Doctorow
"For the Win", Cory Doctorow
"Accelerando", Charles Stross
"Tainaron", Leena Krohn
"Sphinx or Robot", Leena Krohn
"War and Peace", Leo Tolstoy, PG Etext 2600
"Don Quixote", Miguel de Cervantes [Saavedra], translated by John Ormsby, PG Etext 996
"Gulliver's Travels", Jonathan Swift, transcribed from the 1892 George Bell and Sons edition by David Price, PG Etext 829
"Alice's Adventures in Wonderland", Lewis Carroll, PG Etext 11
"Through The Looking-Glass", Lewis Carroll, PG Etext 12
"Alice's Adventures in Wonderland" and "Through The Looking-Glass", Lewis Carroll, PG Etexts 11 and 12
"Gnu Public License 2", (GPL 2) Free Software Foundation
"Gnu Public License v3 - Third discussion draft", (GPLv3) Free Software Foundation
"Debian Social Contract"
"Debian Constitution v1.3", (simple/default markup)
"Debian Constitution v1.3", (markup adjusted for output to more closely match the original)
"Debian Constitution v1.2", (simple/default markup)
"Debian Constitution v1.2", (markup adjusted for output to more closely match the original)
"A Uniform Sales Terminology", Vikki Rogers and Albert Kritzer
"The Autonomous Contract" 1997 - markup sample
"The Autonomous Contract Revisited" - markup sample
"United Nations Convention on Contracts for the International Sale of Goods"
/PECL/ the "Principles of European Contract Law"
2.3 SQL - PostgreSQL, SQLite
2.4 Lex Mercatoria as an example
2.5 For good measure the markup for a document with lots of (simple) tables
2.6 And a link to the output of a reported case

3. A Checklist of Output Features

4. Introduction to SiSU Markup  114 

4.1 Summary
4.2 Markup Examples
4.2.1 Online
4.2.2 Installed

5. Markup of Headers

5.1 Sample Header
5.2 Available Headers

6. Markup of Substantive Text

6.1 Heading Levels
6.2 Font Attributes
6.3 Indentation and bullets
6.4 Footnotes / Endnotes
6.5 Links
6.5.1 Naked URLs within text, dealing with urls
6.5.2 Linking Text
6.5.3 Linking Images
6.6 Grouped Text
6.6.1 Tables
6.6.2 Poem
6.6.3 Group
6.6.4 Code
6.7 Book index

7. Composite documents markup

Markup Syntax History

8. Notes related to Files-types and Markup Syntax

9. Commands Summary

9.1 Description
9.2 Document Processing Command Flags

10. command line modifiers

11. database commands

12. Shortcuts, Shorthand for multiple flags

12.1 Command Line with Flags - Batch Processing

Technical Information

13. Technical notes

13.1 See abandoned U.S. Provisional Patent Application

14. Diagram / Chart

14.1 The Chart
14.2 I/O
14.3 The Program
14.4 Software utilised
14.4.1 SiSU
14.4.2 SiSU Modules

15. SiSU development environment and technologies of interest, including data formats

15.1 Development environment, Debian
15.2 Programming language, Ruby
15.3 SGML & XML Family
15.3.1 SGML
15.3.2 XML Family
15.4 TeX Family
15.5 Pdf
15.6 Relational Databases, SQL
15.7 Other Databases
15.8 Text Search
15.9 Character Encoding, Unicode
15.10 Information Visualization
15.11 Metadata - semantic
15.12 Syndication, Web feed formats
15.13 Other
15.14 Editors
15.15 Version Control
15.16 Licenses

A Summary of notable events

16. A history of SiSU and its outputs including search

A Chronological history of developments on SiSU

1993

1994

1995

1996

1997

1998

1999

2000

2001

2002

2003

January
February
March
April
June
July
August
September
November
December

2004

January
February
March
April
May
June
July
August
September
October
November
December

2005

January
February
March
April
May
June
July
August
September
October
November
December

2006

January
February
March
April
May
June
July
August
September
October
November
December

2007

January
February
March
April
May
June
July
August
September
November
December

2008

January
February
April
June
September
October
November
December

2009

January
December

2010

March

2010

March

FAQ, Howto, Installation, etc.

HowTo

17. Getting Help

17.1 SiSU "man" pages
17.2 SiSU built-in help
17.3 Command Line with Flags - Batch Processing

18. Setup, initialisation

18.1 initialise output directory
18.1.1 Use of search functionality, an example using sqlite
18.2 misc
18.2.1 url for output files -u -U
18.2.2 toggle screen color
18.2.3 verbose mode
18.2.4 quiet mode
18.2.5 maintenance mode intermediate files kept -M
18.2.6 start the webrick server
18.3 remote placement of output

19. Configuration Files

20. Markup

20.1 Headers
20.2 Font Face
20.2.1 Bold
20.2.2 Italics
20.2.3 Underscore
20.2.4 Strikethrough
20.3 Endnotes
20.4 Links
20.5 Number Titles
20.6 Line operations
20.7 Tables
20.8 Grouped Text
20.9 Composite Document

21. Change Appearance

21.1 Skins
21.2 CSS

Extracts from the README

22. README

22.1 Online Information, places to look
22.2 Installation
22.2.1 Debian
22.2.2 RPM
22.2.3 Source package .tgz
22.2.4 to use setup.rb
22.2.5 to use install (prapared with "Rake")
22.2.6 to use install (prapared with "Rant")
22.3 Dependencies
22.4 Quick start
22.5 Configuration files
22.6 Use General Overview
22.7 Help
22.8 Directory Structure
22.9 Configuration File
22.10 Markup
22.11 Additional Things
22.12 License
22.13 SiSU Standard

Extracts from man 8 sisu

23. Post Installation Setup

23.1 Post Installation Setup - Quick start
23.2 Document markup directory
23.2.1 Configuration files
23.2.2 Debian INSTALLATION Note
23.2.3 Document Resource Configuration
23.2.4 Skins

24. FAQ - Frequently Asked/Answered Questions

24.1 Why are urls produced with the -v (and -u) flag that point to a web server on port 8081 ?
24.2 I cannot find my output, where is it?
24.3 I do not get any pdf output, why?
24.4 Where is the latex (or some other interim) output?
24.5 Why isn't SiSU markup XML
24.6 LaTeX claims to be a document preparation system for high-quality typesetting. Can the same be said about SiSU?
24.7 Can the SiSU markup be used to prepare for a LaTex automatic building of an index to the work?
24.8 Can the conversion from SiSU to LaTeX be modified if we have special needs for the LaTeX, or do we need to modify the LaTeX manually?
24.9 How do I create GIN or GiST index in Postgresql for use in SiSU
24.10 Are there some examples of using Ferret Search with a SiSU repository?
Have you had any reports of building SiSU from tar on Mac OS 10.4?
24.12 Where is version 1?
24.13 What is the difference between version 1 and 2?

Installation

25. Installation

25.1 Debian
25.2 Other Unix / Linux
25.2.1 source tarball

26. SiSU Components, Dependencies and Notes

26.1 sisu
26.2 sisu-complete
26.3 sisu-examples
26.4 sisu-pdf
26.5 sisu-postgresql
26.6 sisu-remote
26.7 sisu-sqlite

27. Quickstart - Getting Started Howto

27.1 Installation
27.1.1 Debian Installation
27.1.2 RPM Installation
27.1.3 Installation from source
27.2 Testing SiSU, generating output
27.2.1 basic text, plaintext, html, XML, ODF, EPUB
27.2.2 LaTeX / pdf
27.2.3 relational database - postgresql, sqlite
27.3 Getting Help
27.3.1 The man pages
27.3.2 Built in help
27.3.3 The home page
27.4 Markup Samples

28. SiSU Components, Dependencies and Notes

29. Breakage and Fixes

31st October 2006 - SiSU < 0.48.3 break against Ruby > 1.8.5-3, break on cyclic include; Fixed SiSU: >=0.48.3 (see notes)
21st September 2005 - Avoid ruby-1.8.3 (2005-09-21) and (2005-10-12), Ruby Segfaults; Fixed: later versions of Ruby (see notes)

License, Standard

30. License

31. Things SiSU Standard

Download information

Download information

32. Download SiSU - Linux/Unix

SiSU Current Version - Linux/Unix
Source (tarball tar.gz)
Git (source control management)
Debian
RPM

Changelog - sisu

33. SiSU Version Manifest / changelog

Current version
3.0
Previous versions
2.7
2.6
2.5
2.4
2.3
2.2
2.1
2.0
1.0
0.71
0.70
0.69
0.68
0.67
0.66
0.65
0.64
0.63
0.62
0.61
0.60
0.59
0.58
0.57
0.56
0.55
0.54
0.53
0.52
0.51
0.50
0.49
0.48
0.47
0.46
0.45
0.44
0.43
0.42
0.41
0.40
0.39
0.38
0.37
0.36
0.35
0.34
0.33
0.32
0.31
0.30
0.29
0.28
0.27
0.26
0.25
0.24
0.23
0.22
0.21
0.20
0.18
0.16
0.14
0.12
0.10
0.8
0.6
0.4
0.2
0.1
Release

Changelog - sisu-markup-samples

34. Version Manifest / changelog - SiSU Markup Samples

Current version
2.0
1.1
1.0

Method for providing digital documents including a common citation structure

[SiSU Provisional Patent Application of 2004 based on much older idea and work on SiSU, Abandoned]

The 'Invention' described (and diagrams) by Ralph Amissah.
Provisional patent application text prepared by Stephan Filipek of Winston & Strawn LLP

35. 1. Background

36. 2. Definitions

37. 3. Brief Descriptions of the Drawings

38. 4. Detailed Description of the Preferred Embodiments

39. 5. Document Processing, examples of subsequent steps

40. 6. Advantages of the Invention

41. 7. THE CLAIMS

Post Filing Appendix

42. Post Filing Appendix: Reasons for Abandonment of Patent Process Claim

Endnotes

Endnotes

Metadata

SiSU Metadata, document information

Manifest

SiSU Manifest, alternative outputs etc.

SiSU - SiSU information Structuring Universe - Structured information, Serialized Units,
Ralph Amissah

Structured information, Serialized Units

A Chronological history of developments on SiSU

2004

January
February
March
April
May
June
July
August
September
October
November
December

January

1st January 2004 Happy New Year.

February

7-8th February 2004 It appears the Ruby regular expression engine was replaced last night in Debian Sid (I update daily)  336  which resulted in SiSU to sputtering and made necessary an unexpected code updating of SiSU... no more sputtering SiSU is back in alignment, and (taking into account the possibility of a few remaining bugs as yet undetected) under control.

Around this time I saw AniMatrix ... my favourite movies have remained stable for a while, this perhaps makes the list. The Usual Suspects (clever, fun), 12 Monkeys (science fiction, dark, disturbing, it works), Being John Malkovich (illogical and extremely clever, a quirky combination that works), The Matrix   337 

10th February 2004 SiSU man page added (other forms of help are also provided).

25th February 2004 Bought and read "RELAX NG" by Eric van der Vlist.  338  Started "modelling" SiSU XML structures. Glad to have waited to look at XML.

March

4th March 2004 Bought March issue of the Linux Journal, issue 119 for articles on "Standard Office Document Format: Oasis"  339  and "Manipulating OOo Documents with Ruby" 340  Glad to have waited this long to look at XML.

4th March 2004 SiSU's online help improved, introduced SiSU help prompt (other forms of help are provided including a man page).

6th March 2004 SiSU generates well formed XML, first tests passed. A way to go... Glad to have waited this long to look into generating XML. XML output is likely to be modified. Will add some semantic XML markup options to the SiSU's input/markup syntax... (e.g. titles, authors etc. for books cited within text). Bear in mind, the system is to be kept uncomplicated.

7th March 2004 Added /darcs/  341  to Ratchet. Ratchet being a simple graphical front-end for repetitive backup and version control commit tasks.  342 

18th March 2004 Very easy to poll semantic data (Dublin Core info set) in SiSU marked up files to produce /RSS/ feeds, of for example all documents on a given subject, keyword, or by a given author, or whatever should be included in the feed. The feed is built based on your document set.

23nd March 2004 A current thumbnail and screenshot, some help output from SiSU and ratchet in its present form (if you go for the screenshot, it may be necessary to scroll right to see ratchet):


ratchet, in the right corner of the screen, and some help output from SiSU

March / April 2004 Ralph Amissah - Workshop on Advanced Structured Trade Finance, focusing on Legal Documentation of Trade Finance in Africa organised by the African Export Import Bank, part of panel on "Opportunities and Challenges in Documenting Trade Finance Deals in Africa".

April

3rd April 2004 Vim color syntax files, greys: slate  343  or dusk, and shades (also gold)

9th April 2004 ascii output... finally got round to producing acceptable stripped ascii output for SiSU

11th April 2004 "Free Culture", by Lawrence Lessig  344  (largely) marked up, a prelude to reading it (have not actually seen the book, it is not yet published in the UK). Marked up "Free For All", by Peter Wayner  345  back in 2002 (and emailed it to the author). Will place both these books as output and markup examples (and will offer these versions to the authors).

12th April 2004 As am testing SiSU markup requirements, and suitability for basic literature text selected a text from the Gutenberg Project, and did a transform. The text chosen: "War and Peace" by Leo Tolstoy, (it came to mind as a largish volume) which I shamefully confess not to have read. The markup transforms required are trivial. Of interest, in this instance I am saved by having alternative modes for marking up endnotes... as can do a simple search and replace to make suitable for SiSU using the older endnote markup style.

and the markup:  363 

Part of help output from SiSU ...


help output from SiSU and in the right corner of the screen ratchet

16th April 2004 Minor aesthetic improvements to ascii output. Some further cleanup of output required, but done with general presentation of ascii for now. (Should organise meta-data... semantic information at the end of the ascii file better). Starting to notice and read about several "smart ASCIIs", primarily for ascii to html conversion, reasonable looking ascii with minimal markup requirements for conversion to html (other formats) their goal.  364  Might make sure at some future date that SiSU ascii output is some form of (existing) valid Smart ascii.

17th April 2004 Bored with the old anagram, SiSU as "simple - information structuring unit", or "sisu is structuring unit"... how about "*semantic* and information structuring universe" ;-)  365  well fun anyway, will work a bit on minimalistic semantic ideas, but as they say KISS!  366  ok, "simple - information structuring universe" then.

24th April 2004 Bugfix, SiSU once again permits footnotes/endnotes in headings.  367 

Got my hands on a new edition of "Guide to LaTeX" 4th Edition, by Kopka, H. and Daly, P.W., 2004 Addison Wesley, ISBN 0-321-17385-6.  368 

28-29th April 2004 Having the new edition of Kopka and Daly gave me the excuse I needed to look at LaTeX output. I have used it surprisingly little, but it gave the motivation, and it is a good reference. Knowing that it is there provides some form of information management guarantee, I have the confidence to look at issues, knowing the book is on hand and is likely to provide support should I need it, most of the time it is not needed, but it would have been a waste of time to work on the issues if I had needed it. Lots of small, niggly outstanding issues sorted, things that irked me, like my indented text had too much space between paragraphs (book not needed), and the text within table columns did not wrap (book used). Still some cleaning to be done.

May

2nd May 2004 A host of subtle improvements to LaTeX/pdf output, from page breaking to cover page.

6th May 2004 /Ratchet's/ interface simplified by taking out /arch/ centricity. Have looked a bit more closely at /darcs/ as suggested by Ion's creator. /Ratchet/ is just a slapped together, custom built front-end for my version control and backup tasks, and it would not have been written if it had not been for my unfamiliarity with, and the relative complexity of /arch/ commands.  369 


ratchet on the right; the program text for ratchet in gvim (with my slate color scheme), on ion

wmi  370  the "vim of window managers" spotted in a discussion on the #ion channel. Yet another window manager, this one I like lots, I feel at home here.


ratchet to the right; the program text for ratchet in gvim (with my slate color scheme), on wmi

Ion  371  is my preferred window manager..., but wmi is tempting.

7th May 2004 /SiSU's/ syntax for the time being SiSU permits the other smart ascii markup style without curly brackets, for single words only. This leads to serious consideration of having multiple input syntaxes depending on document type, the current default suits a wide variety of documents from literature to law..., but a different subset of rules would be appropriate for technical writing.

Could not resist another screenshot, SiSU gVim programming work environment, and my /"slate"/ syntax highlighting color scheme  372 


SiSU project files, gvim on wmi


SiSU project files, gvim on ion

First read about /glark/  373  and switched to using it.  374 

June

5th June 2004 Batch conversion to SiSU file format (text markup) from MS Word97, using wvWare  375  and a some additional (Ruby) processing, works pretty well, does the initial processing. Potentially very useful...

"Ralph

Good to meet with you today, I was very impressed with your software.

[colleague's name]  377  - in summary - Ralph has built an application that runs on linux and takes ASCII documents and pulls them apart in to the smallest constituent parts, storing them as XML, PDF and HTML, the HTML are hyperlinked up so the document can be browsed in its full form. the format and text data created is stored in a database.

This has potential in any place that needs the power of full text search whilst holding the structural concepts of the document i.e. legal, pharma, education, research.. which ones we need to figure out, ..."  378 

Special interest was expressed in the search implications of SiSU. To paraphrase, the company has document management systems dealing with hundreds of thousands of texts, these tell you which documents match your search criteria, but cannot inform you where within a text these matches were found without opening the documents. This, is incidentally a feature was implemented a number of years earlier, and part of the even earlier design goal, made possible by powerful implications of the simple concept - object numbering (object citation numbering, ocn).

A follow-up was posted a year later see note on abandoned patent process posted on 5th June 2005   379 

20th June 2004 A note on future work directions, where effort is likely to go. *(a)* Filters, for conversion between SiSU markup and the subset of Open Office "markup" that are supported, giving the possibility to do most document preparation in Open office and possibly further to use SiSU capabilities from Open Office documents. *(b)* Greater effort on the database frills. The backend structural logic and working functionality is in place, but there are some loose ends to tie, also there is added functionality that should be provided using SiSU that is implicit from an SQL DB's design (and the logical structure of tables that are in use). The front end, needs work. *(c)* [not to forget April 17] Semantic tags within text, (for use in for example in XML conversions) keep simple.

June-July 2004 Much work on html conversion tools.

July

10th July 2004 Touched relational db  380  frontend example, works as expected.

Simple search, results with files in which search found, and locations where found within files.


SiSU a frontend to postgresql

Simple search, results with files in which search found, and text object (paragraph or endnote) where found within files.


SiSU a frontend postgresql

Searches can be complex booleans.

14th July 2004 (version control) Ended up making a console (tui) tool out of ratchet. With that the gui version becomes virtually redundant, as this is what I use instead.


Ratchet console/tui version

16th July 2004 Syntax changes for headers and headings.  381  Filenames also shortened, as the extra digit is not used. The name changes restart the cvs ids.  382 

August

9th August Feature freeze on development branch for next generation of code.

30th August - 30th September 9th 2004 Transition, a new generation of code. Primarily concerned with internals, and passing of information between objects. [lexmercatoria.org will continue to use previous generation code though]

September

8th September 2004 Output directory rules simplified, output directory mirrors source directory "stub" name. You would place all associated documents in a directory of a name that signifies this association, and the output directory would be to a sub-directory of the same name in the web serving directory.

A similar rule is used for the sql database into which files are placed.

9th September 2004

1. Directory file association, skins and special image management, made simpler.

The last part of the name of the work directory in which markup is being done, or rather from where SiSU is run in order to generate document output, is used in determining the sub-directory name for output files, that is created in the document output directory. This provides a rather easy way to associate documents e.g. of a given subject, or by owner.

  /www/docs
      /intellectual_property
      /arbitration
      /contract_law

  /www/docs
      /ralph
      /sisu

all are placed in their own directories within the directory structure created. Similar rules are used in the creation of sql type databases (though they can be overridden).

There are a couple of further associations with these directories.

Directory wide skins.

Directory specific images.

2. If there is a "directory skin", that is a skin of the same name as the directory, it is used in the generation of the documents within it, rather than the default skin, unless the document has a specific skin associated with it.

a. default skin (always available)

b. directory skin (precedence over default if exists)

c. document skin (takes precedence wherever document requests a specific skin)

Skins are defined in the document skin directory and if a directory association is desired a softlink made to the relevant skin. Skins (directory association auto load) auto load skin if a directory skin exists of same name as directory stub, (and there is no specific doc skin)

3. If the working directory has within it a sub-directory called image_local, the images within that directory are used for references to images, that are not part of the default site build.

14th September 2004 Difficulty deciding on which file extension to use for sisu text files, currently ".r", ".s", ".t", ".rs" (and ".lm") with the digits 0-3 following the letter, [.er is dropped].

28th September 2004 Further modification to file naming conventions, in future will use an underscore in preference to a dot for separation, e.g. eu_contract_principles_parts_1_to_3_2002.sst rather than eu.contract.principles.parts.1.to.3.2002.sst - old naming will also continue to exist.

October

1st October 2004 The second edition of "Programming Ruby" aka Pickaxe II - my (first) copy arrives, a pdf.  384  it is almost 4 years since the first edition of the Pickaxe,  385  made my considerable Perl book library obsolete, overnight.  386  The book original Pickaxe is pretty good, and available online, (the second edition is better still) but what was fantastic for me about the first Pickaxe which I was able to get my hands on in November 2000, was it was the gateway to Ruby (that I had been waiting for, since I had targeted evaluating the use of Ruby instead of Perl in February 2000, but had no wish to attempt to do that from scratch, i.e. without an English text on the subject).


Ruby takes over, November 2000

With Ruby's POLS,  387  that type of book collecting became redundant... and the gap between idea and implementation narrowed... and code became more reliable/dependable/maintainable and more easily "flexed"... beauty, elegance... Happy coding.  388 

Around this time, I am not sure exactly when I also stumbled upon and read "A Hacker Manifesto" by McKenzie Wark, as I recall an excellent Marxist based analysis of the struggle over intellectual property rights in software development.


A Hacker Manifesto, McKenzie Wark

  389 

November

13th November 2004 Ion window manager, style and the most powerful functionality  390  and Screen  391  - screenshot of my current window manager and "term multiplexor", computer interface:


Ion3 - window 7 of 10

A bit contrived as I would seldom need that much clutter in the float window (left side of monitor), but it gives the general idea, and of how my Ion frames (for window 7) and term /screen(s)/ are set up.

December

8th December 2004 Tentative steps on DB (postgresql) front-end, ie search form.

10th December 2004 Finally got to look at a copy of the second edition of "The LaTeX Companion" 392  Fantastic. I ordered a copy from Amazon on November 26th, along with several other books, for some reason (could it be Christmas) Amazon is taking ages over posting my order! [finally shipped December 22nd arrived December 30th]

12th December 2004 Front end for Postgresql  393  database looking pretty good. Much work on the front end to the database, and now that I finally get round to looking at it, I wonder what took me so long. One thing is the fact that from the technology perspective it is the back-end database and its structure that really matter. Front-end(s) can then be provided in any number of ways, (though being what an end user sees is no less important).

Several options for output - select database to search, show results in index view (links to locations within text), show results with text, echo search in form, show what was searched, create and show a "canned url" for search, show available search fields. Also shows counters number of documents in which found and number of locations within documents where found. [could consider sorting by document with most occurrences of the search result].

Other things not done, a back-end issue with Postgresql, the default field/cell maximum size set by Postgresql for indexing on VARCHARS, is too small, would have to set that differently before production use.

I am fairly pleased with it. This sampling on a small database demonstrates functionality (which could be extended further with current back-end). I will prune the images later. Click on thumbnail for larger image:


SiSU search frontend to Postgresql the form when empty


SiSU search frontend to Postgresql available fields


SiSU search frontend to Postgresql author and text fields


SiSU search frontend to Postgresql title and text search, index view


SiSU search frontend to Postgresql title and text search, index view


SiSU search frontend to Postgresql title and regex text search, index view


SiSU search frontend to Postgresql simple search, index view


SiSU search frontend to Postgresql regex search, index view


SiSU search frontend to Postgresql regex search, text view


SiSU search frontend to Postgresql regex search, title War and Peace, text view


SiSU search frontend to Postgresql simple search, title restriction, text view

20th December 2004 Change of revision control representation, using some recommendations from the RPA (Ruby Production Archive)  394  team. Appears in Document Information under generated by, at this time SiSU Sabaki 0.1.0-6 of 2004w51/1 (20041220)  395  Sabaki is a Japanese word that is used in the board game Go (Chinese Weichi, Korean Badok) to refer to shape, (the previous working name was Scribe).

31st December 2004 Look at license and copyright notices. Problem, have opposed interests, one being my preference for the GPL  396  with regard to program licensing, the other being to be able to maintain a standard for markup and for the object citation numbering system, and also have some recognition for what amounts to the results of my several years of research into electronic document publishing, as represented by SiSU and described in this document. I have ended up with a GPL license for the program, but claiming copyright over (i) the document markup syntax and meta-markup syntax, and more importantly, (ii) the object citation numbering system, a very simple idea that makes so much possible, from document format independent citation, to sql and other database search indexes, with citation references for matches. At present the license information reads as follows:

Generated by SiSU www.jus.uio.no/sisu/SiSU/ [ SiSU Sabaki 0.1.0-8 2004w51/4 ]

Using SiSU Standard markup and meta-markup syntax and SiSU's object citation numbering system © Ralph Amissah 1997, current 2004. All Rights Reserved.

SiSU is released under GPL 2 or later ( ‹http://www.gnu.org/copyleft/gpl.html› ) and is alternatively available under special license terms from Ralph Amissah.

I will take advice and recognise that it may have to be changed.


SiSU thanks Ruby and Gnu Debian Linux

Also start using more neutral icons. If I were not using Debian/Gnu/Linux Sid,  397  and satisfied with the package manager, range of applications and social contract,  398  I would probably be using one of the BSD's, Dragonfly  399  or FreeBSD.  400  Also I guess despite not judging the success of a project by its adoption, over promotion of a distro should not be placed as a disincentive for use on other Unix/Linux platforms.


SiSU Sabaki

End of year ToDo:

Release SiSU early 2005. Dual license, one of them "GPL 2 or later", other, reserved, proprietary.

Documentation. There is some, but the presentation is nowhere near as digestible as it should be.

Unicode UTF-8. The biggest single todo is Unicode processing. LaTeX and Postgresql support UTF-8 so that is what it is most likely to be. My excuse for not having looked at it yet ... need to date, and not having configured my environment for it. I do however recognise this as a need.

Redo sqlite module ... one of the more interesting to have. Get the Sqlite module working again. Similar to the Postgresql module, fell out of maintenance, when I found Sqlite to be a bit of a pain to install on Debian, (and was prioritising Postgresql), once upon a time the modules were in sync, and I hope to have them that way again someday.

Code cleaning. Much code cleaning ... this project has developed over several years, and there have been many changes in how things are done, without rigorous removal of dead code.

Simplify Installation, and test across other Unix and Gnu/Linux platforms. (Mac OSX too? I wonder...)

Object Citation Numbering for non-substantive objects. OCN is currently done only for substantive text and other objects (such as images), a secondary numbering will eventually be implemented for non-substantive items.

Images. Decide what to do with images and tables in XML and in relational database.

Marshalled/PStored Metaverse. As an alternative (not replacement) to the current ordinary text based SiSU meta-markup state.

Additional Syntax hi-lighters. The current syntax hi-lighter, and folds are for vim. Additional syntax highlighters for SiSU markup would be extremely welcome, they don't need to be as complete as the vim highligter. Emacs would obviously be nice, but the ruby editors, and less geeky editors are of much interest. Not sure that I will do this, after all I do use Vim, we'll see.

Take your time, "release" apart, no idea when any of this can/will be done. I love it, it is great for what it does.




 336. ruby 1.8.1 (2004-02-03) [i386-linux]

 337. I, watched II it was such a disappointment I did not see III

 338. RELAX NG by Eric van der Vlist, O'Reilly isbn 0596004214. Online version at

http://books.xmlschemata.org/relaxng/› or

http://books.xmlschemata.org/relaxng/page2.html

The home page for RELAX NG is ‹http://www.relaxng.org/› orignally designed by Murata Makoto and James Clark and now by the RELAX NG technical committee of OASIS

 339. Linux Journal, 119, March 2004, pg 64. Online at

http://www.linuxjournal.com/article.php?sid=7159

 340. Linux Journal, 119, March 2004, pg 78. Online at

http://www.linuxjournal.com/article.php?sid=7236

 341.http://www.abridgegame.org/darcs/

 342.http://localhost/reserved/sisu/1#h1.18.1› a bit on ratchet

 343. screenshots: ‹http://www.jus.uio.no/sisu/_sisu/image_local/snap_gvim_wmi200405.png

http://www.jus.uio.no/sisu/_sisu/image_local/snap_sisu_sample_text_gvim200405.png

 344.http://www.free-culture.cc/

 345.http://www.wayner.org/books/ffa/

 346.http://www.jus.uio.no/sisu/free_for_all.peter_wayner

http://www.jus.uio.no/sisu/free_for_all.peter_wayner/toc.html› (the same)

 347.http://www.jus.uio.no/sisu/free_for_all.peter_wayner/portrait.pdf

 348.http://www.jus.uio.no/sisu/free_for_all.peter_wayner/landscape.pdf

 349.http://www.jus.uio.no/sisu/free_for_all.peter_wayner/plain.txt

 350.http://www.jus.uio.no/sisu/free_for_all.peter_wayner/sax.xml

 351.http://www.jus.uio.no/sisu/free_for_all.peter_wayner/dom.xml

 352.http://www.jus.uio.no/sisu/free_for_all.peter_wayner/concordance.html

 353.http://www.jus.uio.no/sisu/free_culture.lawrence_lessig

 354.http://www.jus.uio.no/sisu/free_culture.lawrence_lessig/portrait.pdf

 355.http://www.jus.uio.no/sisu/free_culture.lawrence_lessig/landscape.pdf

 356.http://www.jus.uio.no/sisu/free_culture.lawrence_lessig/plain.txt

 357.http://www.jus.uio.no/sisu/free_culture.lawrence_lessig/sax.xml

 358.http://www.jus.uio.no/sisu/free_culture.lawrence_lessig/concordance.html

 359.http://www.jus.uio.no/sisu/war_and_peace.leo_tolstoy

 360.http://www.jus.uio.no/sisu/war_and_peace.leo_tolstoy/portrait.pdf

 361.http://www.jus.uio.no/sisu/war_and_peace.leo_tolstoy/landscape.pdf

 362.http://www.jus.uio.no/sisu/war_and_peace.leo_tolstoy/plain.txt› ascii text is produced either with footnotes following paragraph that contains them, or with endnotes at the end of text under each section/"chapter" heading.

 363. I did a stylesheet/skin for the Gutenberg Project, ... I may have to remove. The markup transforms required are trivial. Of interest, in this instance I am saved by having alternative syntaxes/(structural modes) for marking up endnotes... as it is possible to do a simple search and replace to make Project Gutenberg ascii presentations suitable for SiSU using the older endnote markup style. There is very little markup required after the document headers and Project Gutenberg legal notices. As I presume the legal notices are similar in Project Gutenberg documents, (and I could not bear to think of preparing the same legal notices twice), I moved those to the "skin" for the Project, and these are now represented in the markup by the <:insert1> and <:insert2> markers and the legal notices are available for similar insertion into the next Project Gutenberg text prepared for SiSU, should there be one.

 364.http://webseitz.fluxent.com/wiki/SmartAscii› Smart ascii

http://www.textism.com/tools/textile/› Textile by Mark Pilgrim

http://www.whytheluckystiff.net/ruby/redcloth/› /RedCloth/ (Ruby Textile implementation, by _why)

http://daringfireball.net/projects/markdown/› Markdown by John Gruber

http://bluecloth.rubyforge.org/› /BlueCloth/ (Ruby Markdown implementation, by Michael Granger and the FaerieMUD Consortium)

 365. A little universe it may be, but semantic you may have a hard time getting away with, given the meaning the word has taken on with markup. On a document wide basis semantic information may be provided, which can be really useful, (and meaningful, especially) if you have a large document set, and use this with rss feeds or in an sql database etc. On a markup level, I have little inclination to add semantic markup formally beyond references, title, author [Dublin Core entities? addresses?] etc. Actually this deserves a bit of thought possibly use letter tags (including letter alias/synonyms for font faces) to create a small set of default semantic tags, with the possibility for per document adjustments. Will seek to permit XML entity tagging, within SiSU markup and have that ignored/removed by the parts of the program that have no use for it.

 366. keep it simple stupid

 367. A report on a document not behaving as it should led to the discovery of this problem, too much effort to discover exactly when it must have arisen.

 368. From which I understand, amongst other things, that the LaTeX to pdf package (used by SiSU), hyperref, by Sebastian Rahtz is now maintained by Heiko Oberdiek.

 369.http://localhost/reserved/sisu/1#h1.18.1› a bit on ratchet

 370.http://wmi.berlios.de/

 371.http://iki.fi/tuomov/ion/

 372. slate of April 3rd 2004

 373.http://glark.sourceforge.net/› mentioned in comp.lang.ruby newsgroup

 374. so in the screenshot the grep tla help command (with the search tuned and expanded as required) should rather be:

tla help | glark -N -B 0 -A 0 "*\s+"

 375.http://wvware.sourceforge.net/› ‹http://sourceforge.net/projects/wvware

 376. meeting with an IBM Software Innovations evaluator, IBM, London

This meeting came about through a chance encounter with an IBM manager at a Linux Expo, who was curious about my interest in Gnu/Linux with my legal background... on hearing that I also wrote software, he suggested, maybe IBM should have a look at it. I was interested, the meeting was set up...

 377. also posted to an IBM colleague

 378. Software Innovations evaluator, IBM, London

 379.http://www.jus.uio.no/sisu/SiSU/2005.html#ppa

 380. July 2002

 381.http://www.jus.uio.no/sisu/sisu_markup_table/doc.html#h200306› updated to include header and heading changes

 382. SiSU markup syntax, an incomplete summary: ‹http://www.jus.uio.no/sisu/sisu_markup_table/doc.html#h200306

Visual check of elementary font face modifiers: *bold* bold emphasis italics underscore strikethrough superscript subscript

 383.http://www.jus.uio.no/sisu/diagram/sisu_provisional_patent_application_diagram_200408.pdf

MD5(sisu_provisional_patent_application_diagram_200408.pdf)= 1cd98065251fa421e1b722bd7a7a78c2

 384.http://www.pragmaticprogrammer.com/titles/ruby/"Programming Ruby" aka Pickaxe II, Dave Thomas and Andrew Hunt

 385.http://www.ruby-doc.org/docs/ProgrammingRuby/› text available online

 386. Regular expressions (pattern matching) are probably the biggest single thing that came and stayed with me from Perl.

 387. "principle of least surprise" to Matz, Ruby's creator author.

 388.http://www.ruby-lang.org/en/› Ruby

 389. "A Hacker Manifesto", McKenzie Wark

http://www.amazon.com/exec/obidos/ASIN/0674015436

 390.http://iki.fi/tuomov/ion/› Ion "... search for a graphical usable interface"

 391.http://c2.com/cgi/wiki?ScreenMultiplexor

http://packages.debian.org/cgi-bin/search_packages.pl?searchon=names&version=all&exact=1&keywords=screen

 392. "The LaTeX Companion" 2nd ed. by Frank Mittelbach, Michel Goossens, Johannes Braams, David Carlisle, Chris Rowley, 2004 ISBN: 0201362996

 393.http://www.postgresql.org/

http://advocacy.postgresql.org/

http://en.wikipedia.org/wiki/Postgresql

 394.http://rpa-base.rubyforge.org/wiki/wiki.cgi?

 395. previous string would have been from arch tla revision control.

 396.http://www.gnu.org/copyleft/gpl.html

 397.http://www.debian.org/

 398.http://www.debian.org/social_contract

 399.http://www.dragonflybsd.org/main/

 400.http://www.freebsd.org/


[ document manifest ]
<< previous TOC next >>
< ^ >



SiSU


Viral Spiral - How the Commoners Built a Digital Republic of Their Own

David Bollier

2009


The Wealth of Networks - How Social Production Transforms Markets and Freedom

Yochai Benkler

2006


Free Culture - How Big Media Uses Technology and the Law to Lock Down Culture and Control Creativity

Lawrence Lessig

2004


CONTENT - Selected Essays on Technology, Creativity, Copyright and the Future of the Future

Cory Doctorow

2008


Democratizing Innovation

Eric von Hippel

2005


Free As In Freedom - Richard Stallman's Crusade for Free Software

Sam Williams

2002


Two Bits - The Cultural Significance of Free Software

Christopher Kelty

2008


Free For All - How Linux and the Free Software Movement Undercut the High Tech Titans

Peter Wayner

2002


The Cathedral & the Bazaar - Musings on Linux and Open Source by an Accidental Revolutionary

Erik S. Raymond

1999


Little Brother

Cory Doctorow

2008


Down and Out in the Magic Kingdom

Cory Doctorow

2003


For the Win

Cory Doctorow

2008


Free Software Foundation - FSF