From billlinux at rogers.com Fri Jun 5 20:19:57 2015 Content-Type: multipart/mixed; boundary="===============1303510965665845348==" MIME-Version: 1.0 From: Bill Case To: users at lists.fedoraproject.org Subject: Converting html to text ?!? Date: Fri, 05 Mar 2010 09:51:19 -0500 Message-ID: <1267800679.1787.26.camel@CASE> --===============1303510965665845348== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Hi; I have been using Notecase for a couple of years. I would like to keep it but it seems it is no longer being maintained as open source. =E2=88=B4 I would like to use Lyx as my large note taker or draft writer application. The problem is Notecase uses an *.ncd suffix which is unrecognized by Lyx. When I examine the files I want to import from Notecase to Lyx they are designated in the header as: ... etc. The markup is almost certainly html. How can I convert these files into *.txt? I have tried several variations of: ]$ html2text -o ~/UMLC.txt file:///home/bill/NoteCaseDocs/UMLC. *Cannot open input file "file:///home/bill/NoteCaseDocs/UMLC.*". Any suggestions greatly appreciated on how to import these (I have several) *.ncd files into Lyx. = -- = Regards Bill Fedora 12, Gnome 2.28 Evo.2.28, Emacs 23.1.1 --===============1303510965665845348==-- From bmr at redhat.com Fri Jun 5 20:19:57 2015 Content-Type: multipart/mixed; boundary="===============1896731903242804259==" MIME-Version: 1.0 From: Bryn M. Reeves To: users at lists.fedoraproject.org Subject: Re: Converting html to text ?!? Date: Fri, 05 Mar 2010 15:06:17 +0000 Message-ID: <1267801577.23868.1645.camel@localhost> In-Reply-To: 1267800679.1787.26.camel@CASE --===============1896731903242804259== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable On Fri, 2010-03-05 at 09:51 -0500, William Case wrote: > How can I convert these files into *.txt? I have tried several > variations of: > = > ]$ html2text -o ~/UMLC.txt file:///home/bill/NoteCaseDocs/UMLC. > *Cannot open input file "file:///home/bill/NoteCaseDocs/UMLC.*". > = > Any suggestions greatly appreciated on how to import these (I have > several) *.ncd files into Lyx. YMMV but I'd do something like: for F in foo/*.ncd; do links -dump $F > $(echo $F|sed 's/\.ncd$/.txt/'); done "for each file matching foo/*.ncd convert the dump the content as plain text using links into a file with the same name but the extension .ncd replaced with .txt" I happen to like links but this should work with any html-text converter. Just watch out that it does the right things for what you want with the file names. Regards, Bryn. --===============1896731903242804259==-- From tom.horsley at att.net Fri Jun 5 20:19:58 2015 Content-Type: multipart/mixed; boundary="===============5208297258605482341==" MIME-Version: 1.0 From: Tom Horsley To: users at lists.fedoraproject.org Subject: Re: Converting html to text ?!? Date: Fri, 05 Mar 2010 10:01:28 -0500 Message-ID: <20100305100128.0ec0491f@tomh.ccur.com> In-Reply-To: 1267800679.1787.26.camel@CASE --===============5208297258605482341== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable "lynx" (not lyx :-) is a text mode web browser. With sufficient command line options, you can get it to render html as text and save it in a file in an automated fashion. --===============5208297258605482341==-- From billlinux at rogers.com Fri Jun 5 20:19:58 2015 Content-Type: multipart/mixed; boundary="===============7501070798602825779==" MIME-Version: 1.0 From: Bill Case To: users at lists.fedoraproject.org Subject: Re: [Bulk] Re: Converting html to text ?!? Date: Fri, 05 Mar 2010 10:25:45 -0500 Message-ID: <1267802745.1787.34.camel@CASE> In-Reply-To: 20100305100128.0ec0491f@tomh.ccur.com --===============7501070798602825779== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Hi Tom; On Fri, 2010-03-05 at 10:01 -0500, Tom Horsley wrote: > "lynx" (not lyx :-) is a text mode web browser. With sufficient > command line options, you can get it to render html as text > and save it in a file in an automated fashion. We are talking about different applications. LyX is a type setting program based on LaTex. It can be found in the Fedora 12 repo. = "Lynx is a fully-featured World Wide Web (WWW) client for users running cursor-addressable, character-cell display devices (e.g., vt100 terminals, vt100 emulators running on PCs or Macs, or any other character-cell display)." -- = Regards Bill Fedora 12, Gnome 2.28 Evo.2.28, Emacs 23.1.1 --===============7501070798602825779==-- From pocallaghan at gmail.com Fri Jun 5 20:19:58 2015 Content-Type: multipart/mixed; boundary="===============4954132307897450100==" MIME-Version: 1.0 From: Patrick O'Callaghan To: users at lists.fedoraproject.org Subject: Re: Converting html to text ?!? Date: Fri, 05 Mar 2010 11:12:26 -0430 Message-ID: <1267803746.31856.0.camel@bree.homelinux.com> In-Reply-To: 1267800679.1787.26.camel@CASE --===============4954132307897450100== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable On Fri, 2010-03-05 at 09:51 -0500, William Case wrote: > Hi; > = > I have been using Notecase for a couple of years. I would like to keep > it but it seems it is no longer being maintained as open source. > = > =E2=88=B4 I would like to use Lyx as my large note taker or draft writer > application. The problem is Notecase uses an *.ncd suffix which is > unrecognized by Lyx. When I examine the files I want to import from > Notecase to Lyx they are designated in the header as: > = > > > > > http-equiv=3D"Content-Type"> > > > ... etc. > = > The markup is almost certainly html. > = > How can I convert these files into *.txt? I have tried several > variations of: > = > ]$ html2text -o ~/UMLC.txt file:///home/bill/NoteCaseDocs/UMLC. > *Cannot open input file "file:///home/bill/NoteCaseDocs/UMLC.*". > = > Any suggestions greatly appreciated on how to import these (I have > several) *.ncd files into Lyx. For a smallish number of files, the easiest is probably to open them in Firefox and use Save As ... text. poc --===============4954132307897450100==-- From ignored_mailbox at yahoo.com.au Fri Jun 5 20:20:01 2015 Content-Type: multipart/mixed; boundary="===============7348504029002275497==" MIME-Version: 1.0 From: Tim To: users at lists.fedoraproject.org Subject: Re: [Bulk] Re: Converting html to text ?!? Date: Sat, 06 Mar 2010 17:45:04 +1030 Message-ID: <1267859704.3881.1.camel@suspishus.lan.cameratim.com> In-Reply-To: 1267802745.1787.34.camel@CASE --===============7348504029002275497== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable On Fri, 2010-03-05 at 10:25 -0500, William Case wrote: > We are talking about different applications. LyX is a type setting > program based on LaTex. It can be found in the Fedora 12 repo. = > = > "Lynx is a fully-featured World Wide Web (WWW) client for users > running cursor-addressable, character-cell display devices (e.g., > vt100 terminals, vt100 emulators running on PCs or Macs, or any other > character-cell display)." You didn't read what he said about using Lynx to turn HTML into plain text... -- = [tim(a)localhost ~]$ uname -r 2.6.27.25-78.2.56.fc9.i686 Don't send private replies to my address, the mailbox is ignored. I read messages from the public lists. --===============7348504029002275497==-- From billlinux at rogers.com Fri Jun 5 20:20:03 2015 Content-Type: multipart/mixed; boundary="===============4320144590321568613==" MIME-Version: 1.0 From: Bill Case To: users at lists.fedoraproject.org Subject: Re: [Bulk] Re: Converting html to text ?!? Date: Sat, 06 Mar 2010 10:16:42 -0500 Message-ID: <1267888602.1867.2.camel@CASE> In-Reply-To: 20100305100128.0ec0491f@tomh.ccur.com --===============4320144590321568613== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Hi Tom; On Fri, 2010-03-05 at 10:01 -0500, Tom Horsley wrote: > "lynx" (not lyx :-) is a text mode web browser. With sufficient > command line options, you can get it to render html as text > and save it in a file in an automated fashion. Tim was right. I mis-read your advice last night. Downloaded lynx this morning and it seems to do what I want. I have to learn to play around with saving to text etc. but all the needful stuff seems to be there. -- = Regards Bill Fedora 12, Gnome 2.28 Evo.2.28, Emacs 23.1.1 --===============4320144590321568613==-- From billlinux at rogers.com Fri Jun 5 20:20:04 2015 Content-Type: multipart/mixed; boundary="===============2732857705742706258==" MIME-Version: 1.0 From: Bill Case To: users at lists.fedoraproject.org Subject: Re: Converting html to text ?!? [SOLVED] -- but me or a bug. Date: Sat, 06 Mar 2010 12:22:21 -0500 Message-ID: <1267896141.1867.14.camel@CASE> In-Reply-To: 1267800679.1787.26.camel@CASE --===============2732857705742706258== Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Hi; On Fri, 2010-03-05 at 09:51 -0500, William Case wrote: = > Hi; > = > I have been using Notecase for a couple of years. I would like to keep > it but it seems it is no longer being maintained as open source. > = > =E2=88=B4 I would like to use Lyx as my large note taker or draft writer > application. The problem is Notecase uses an *.ncd suffix which is > unrecognized by Lyx. When I examine the files I want to import from > Notecase to Lyx they are designated in the header as: > = > > > > > http-equiv=3D"Content-Type"> > > > ... etc. > = > The markup is almost certainly html. > = > How can I convert these files into *.txt? I have tried several > variations of: > = > ]$ html2text -o ~/UMLC.txt file:///home/bill/NoteCaseDocs/UMLC.txt > *Cannot open input file "file:///home/bill/NoteCaseDocs/UMLC.*". > = > Any suggestions greatly appreciated on how to import these (I have > several) *.ncd files into Lyx. > = The real solution was: ]$ html2text ~/NoteCaseDocs/UMLC.ncd >> ~/NoteCaseDocs/UMLC.txt Then importing .txt files into LyX. Worked like a charm! However, shouldn't have = ]$ html2text ~/NoteCaseDocs/UMLC.ncd -o ~/NoteCaseDocs/UMLC.txt worked as well. Or, have I misunderstood the use of the '-o' option ?!? -- = Regards Bill Fedora 12, Gnome 2.28 Evo.2.28, Emacs 23.1.1 --===============2732857705742706258==--