HTML2TXT Version 1.0
Copyright (C) Thomas Sahlin 1997
HTML2TXT can convert any HTML (HyperText Markup Language) file into a
standard ASCII text file. The text file can then be imported by most
text editors, word processors, DTP packages etc.
Once you have selected a HTML file you wish to convert, HTML2TXT will
load in the file, remove all tags, and format the output text file to
match the original HTML file.
This version of HTML2TXT does not support tables correctly, nor does it
support numbered or nested lists, however it does support
centered text like this,
right justified text like this,
indented text like this,
pre-formated
text
like
this,
- and basic
- lists
- like this.
The text file you're reading right now was actually created using
HTML2TXT! The "HTML2TXT.HTM" is the original HTML file behind
this text file.
If you have CAB you could try and convert the "EXAMPLE.HTM"
file to get a clear picture of what HTML2TXT can and can not do.
HTML2TXT will not convert each and every character from ISO Latin-1
to match the Atari character set, however it supports most of the
special characters that are constructed using the & and ;
characters. For example É becomes É and
é becomes é, this is in most cases enough.
See Appendix A for all supported special characters.
Step by step guide to converting a HTML file
- Double-klick on the "HTML2TXT.PRG" file. The HTML2TXT
window appears and it asks you to press any key, do so.
- Now choose a HTML file you wish to convert from the file-selector
and press OK, HTML files have the suffix "HTM".
- The file you selected has now been added to the batch list, you can
select another one by pressing <SPACE>, or begin the conversion
by pressing <ENTER>. Press <ENTER>.
- HTML2TXT will now ask you how many columns you want the text file to
be formated for (this is the max number of characters one row can hold).
Type in your preferred value and press <ENTER>.
- You will now be given the question if discreet mode is to be used,
if you're using a preemptive multitasking system like MagiC, this option
could be useful, this is also the closest to GEM compatibility you can get
with HTML2TXT. You can test this later, for now press <N>.
- Now you can watch the files you've selected earlier be loaded in and
converted one by one. Pressing <UNDO> or <ESC> will abort
the conversion. HTML2TXT will make some noice to let you now when the
conversion is completed.
- Now press any key to quit HTML2TXT. Check the "HTML2TXT.LOG"
file for details on how the conversion proceeded. It should be located
in the destination directory you selected.
Installation
HTML2TXT does not require any installation, just put the program file were
ever you think it's appropriate to keep it.
Distribution
HTML2TXT is FREEWARE, this means that you should feel free to spread the
program as much as you like under the condition that you do not charge
anyone for it, and that you make sure to pass this manual along with
the program file.
You are not allowed to make any changes to the program or any of it's
contents.
If you would like contact me, if you have any complains about the software,
or if you simply want to keep your local post-office in business, you can
send me a letter at:
Thomas Sahlin
Myrstigen 20
151 60 S”dert„lje
SWEDEN
Appendix A - special characters that are supported by
HTML2TXT
In alphabetical order:
Capital characters:
À | À
|
Æ | Æ
|
Å | Å
|
à | Ã
|
Ä | Ä
|
Ç | Ç
|
É | É
|
Ñ | Ñ
|
Ø | Ø
|
Õ | Õ
|
Ö | Ö
|
Þ | Þ
|
Ü | Ü
|
Small characters:
á | á
|
â | â
|
æ | æ
|
à | à
|
ã | ã
|
ä | ä
|
ç | ç
|
é | é
|
ê | ê
|
è | è
|
ð | ð
|
ë | ë
|
í | í
|
î | î
|
ì | ì
|
ï | ï
|
ñ | ñ
|
ó | ó
|
ô | ô
|
ò | ò
|
ø | ø
|
õ | õ
|
ö | ö
|
ß | ß
|
þ | þ
|
ú | ú
|
û | û
|
ù | ù
|
ü | ü
|
ÿ | ÿ
|
Numeric and special graphic characters:
& | &
|
> | >
|
< | <
|
" | "
|
It's also possible to state the ASCII character number.
Example: ! becomes ! and " becomes ".