Hello all! Today I've downloaded all Amiga, Commodore-Amiga and Escom patents that Mark sought from IBM's server earlier. I'll probably leave the Commodore patents (which are probably the most interesting ones) for next Sunday. Downloading the files was a bit tricky, since IBM has removed the undocumented possibility of downloading the master TIFF file of which the GIF images will be generated. So I had to use the "for" command of Bash combined with "lynx -dump" and "lynx -sourcedump" to download the scanned pages as GIF files. I thought that I would ask your help in transforming the HTML files to something more readable and in composing the GIF files back to multi-page TIFF files. Mark already promised to help me in the latter task, but someone should write a Perl script that strips the HTML mark-up from the abstract texts and the claim lists. Those files should be converted to plain text. After the conversions are finished, I can move the patent documents to ftp.funet.fi. There should be no copyright problems, since patent documents are public. If you can assist me with the project, contact me so that the technical details can be arranged. I'm looking for a Perl script solution to the conversion problem, and I would write the script myself if I had time. Marko - This message was sent through the cbm-hackers mailing list. To unsubscribe: echo unsubscribe | mail cbm-hackers-request@dot.tcm.hut.fi.
Archive generated by hypermail 2.1.1.