OASIS Open Document Format for Office Applications (OpenDocument) TC

  • 1.  Archives of ODF files?

    Posted 12-28-2015 21:31
    -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Greetings! I have been extracting the content.xml files from ODF files in an effort to gauge the variety of the XML being produced by ODF applications. Since Google has lacked the foresight to enable limiting search results by ODF file types, :-(, I thought members of the TC might know of existing ODF file collections that I could repurpose? I want to avoid canned example documents because they are unlikely to reflect the experience in the "wild" as it were with ODF documents. Thanks! Hope everyone is having a great holiday season! Patrick - -- Patrick Durusau patrick@durusau.net Technical Advisory Board, OASIS (TAB) OpenDocument Format TC (OASIS), Project Editor ISO/IEC 26300 Co-Editor 13250-5 (Topic Maps) Another Word For It (blog): http://tm.durusau.net Homepage: http://www.durusau.net Twitter: patrickDurusau -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 iQIcBAEBAgAGBQJWgan1AAoJEFPGsgi3MgycQVQQAJsfpX+m83HEZC+KVL3MugYB 9RVwhXp2gDqlLMuzw1eYLJPZWVJkLA/xfbK2EEngBhzDL8MCT2synSP9D8ljjKzP 5HyEVr3kF0gkmSJkHenFYWj4NX5VeBcHuemUN+/6fpcpDbmICxeY/lKtElvmPEOk mCbM2v6D8OL8ZrgILS5H1IGIaKE7ivioj64pqmtmo4JfGNZPx6CFAvAmW+x3k96i ljUVBK3PXy4mE5MMyPLuNo5Iv6bJh5KSGnAqedGOpyb5r/Ov035xNaoPiHNoUuzT g4e6pimZaT1QjBS8jkd+nrsjGQ+msDWCKJ8qqhuyxnD23bxJkP8lNEfd5AAQ6943 RqdKRS9ludAOVFHWmT62ry9VPcRcdE75DVmmtc7jn7+KqzZQBbLwUIww2kUFMNlp +qwJQ1dIDzuConESF5VU7iu0QS2AxrHT1E55+PkuKgIOlZUwsrB22trLdTbhZG1x FqPjunm0mzplaKhWpyFXy5+4SW/uaHUvij7QT2xgrqJHW2ZDrgL0YLaoQhpMzTAy mUs2CU5p4+2Do7WO8uLJAx26Zzg5F4F5Fbs0jcsYhCG3XBh7TVC7BwugA+XsF4Vb 05eIsbbcDnPcnInCm7HXDYnEwALQF8s1vmR5UJnsUsBx+2ao1xMN47ymJwqkNpKE DDrTkAGzx+1/cC9XiM6x =4/XT -----END PGP SIGNATURE-----


  • 2.  Re: [office] Archives of ODF files?

    Posted 12-29-2015 01:47
    -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Greetings! Regina advises that despite the lack of odf as a filetype in Google advance search menu that: filetype:odt works. Of course I had to try it for spreadsheets: http://thentao.com/Dungeon-Index.ods was one result. Which made me think of Svante for some reason. ;-) Cool! Patrick PS: But at 237,000 results for .ods, that sounds low to me. You? Same result from Googel.de. I wonder if Google is indexing for the filetype:ods or odt? Is there an obligation to correctly report the file extensions encountered? Maybe we should ask the EU. ;-) PPS: Does make me even more curious why Advanced Google Search doesn't have the ODF file extensions? On 12/28/2015 04:30 PM, Patrick Durusau wrote: > Greetings! > > I have been extracting the content.xml files from ODF files in an > effort to gauge the variety of the XML being produced by ODF > applications. > > Since Google has lacked the foresight to enable limiting search > results by ODF file types, :-(, I thought members of the TC might > know of existing ODF file collections that I could repurpose? > > I want to avoid canned example documents because they are unlikely > to reflect the experience in the "wild" as it were with ODF > documents. > > Thanks! > > Hope everyone is having a great holiday season! > > Patrick > > > --------------------------------------------------------------------- > > To unsubscribe from this mail list, you must leave the OASIS TC that > generates this mail. Follow this link to all your TCs in OASIS > at: > https://www.oasis-open.org/apps/org/workgroup/portal/my_workgroups.php > > > - -- Patrick Durusau patrick@durusau.net Technical Advisory Board, OASIS (TAB) OpenDocument Format TC (OASIS), Project Editor ISO/IEC 26300 Co-Editor 13250-5 (Topic Maps) Another Word For It (blog): http://tm.durusau.net Homepage: http://www.durusau.net Twitter: patrickDurusau -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 iQIcBAEBAgAGBQJWgeX0AAoJEFPGsgi3MgycFgkQAMmeE6EruV8uM8Y2sbV+3Wzn 2Vqg7qjiClthEHLPbO8VoDRaxQishvLTepXWqrie/wgBLR0CAEH9f5zyeGY7U+B8 jmdSRmLEq4iqAWIxK5KyFpFXJVJ/BCWoPcogd3tjoJqzmOURrDLQ349OM+NQJrIu f4fpsDxxX7eZRRx6UFeWO4OGkMr4FCWuC325PVM5GvawAoqOhv3/6K4tNpC/mp9o uzxPHq5PVlsAFSxOawQTww3lZKtNv7zDw6EqkkzHy8UJQThTY3Sdqe1Bd2QIzbt/ fNIHqSM13XDlset9en2dkhh8cl4Uo0HeSie+qs+jS/BUbi1YFoO9f9qDC8HNKmVf /d8VjEnpVOQqexkA41TtVeBiapUWGDX8fqx1lfSLbqSkChgAHBVSMJtsx/qWj5fJ Go1T0PzDoewko5u7IrFUIuHUF+jyh6j4bnknVORsnPsa3PxHgnIz0lYwE4ynAkUH J/gD9nKIKrKLFIls4dRSfaJeoPoHWK/F2cIH9Dna4prQgoOu4h+PY3etMxmJKcc6 sEbOJ4FWCBfCIyhoGbWwv8dxln2jb0wC8SQVL0EFudFBRLbBK9ML2o0+Vyu5PFUr KoOpvDx7lFCjPJiIhAbyDhKZuIQDFbYHnz9Sf7VhGMuY9rTBBw3qJzU5Qe5Abg2L qry039qEcaUVOklmZDd9 =njk5 -----END PGP SIGNATURE-----


  • 3.  Re: [office] Archives of ODF files?

    Posted 12-31-2015 15:14
    On Monday 28 December 2015 16:30:29 Patrick Durusau wrote: > Greetings! > > I have been extracting the content.xml files from ODF files in an > effort to gauge the variety of the XML being produced by ODF > applications. > > Since Google has lacked the foresight to enable limiting search > results by ODF file types, :-(, I thought members of the TC might know > of existing ODF file collections that I could repurpose? > > I want to avoid canned example documents because they are unlikely to > reflect the experience in the "wild" as it were with ODF documents. > Here is a script to download many ODF files at once. The original is here: https://quickgit.kde.org/?p=calligra.git&a=tree&f=devtools%2Fscripts Despite the name it also works for odt, ods and odp. You need to have the perl modules LWP::Protocol::https and LWP:;UserAgent installed. To download a 100 odp files about protein, run perl downloadMSOfficeDocuments.pl 100 protein odp Cheers, Jos Attachment: downloadMSOfficeDocuments.pl Description: Perl program


  • 4.  Re: [office] Archives of ODF files?

    Posted 01-02-2016 15:45
    -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Jos, Thanks! This should prove to be quite useful. Hope you are at the start of a Happy New Year! Patrick On 12/31/2015 10:13 AM, Jos van den Oever wrote: > On Monday 28 December 2015 16:30:29 Patrick Durusau wrote: >> Greetings! >> >> I have been extracting the content.xml files from ODF files in >> an effort to gauge the variety of the XML being produced by ODF >> applications. >> >> Since Google has lacked the foresight to enable limiting search >> results by ODF file types, :-(, I thought members of the TC might >> know of existing ODF file collections that I could repurpose? >> >> I want to avoid canned example documents because they are >> unlikely to reflect the experience in the "wild" as it were with >> ODF documents. >> > > Here is a script to download many ODF files at once. The original > is here: > https://quickgit.kde.org/?p=calligra.git&a=tree&f=devtools%2Fscripts > > Despite the name it also works for odt, ods and odp. You need to > have the perl modules LWP::Protocol::https and LWP:;UserAgent > installed. > > To download a 100 odp files about protein, run > > perl downloadMSOfficeDocuments.pl 100 protein odp > > Cheers, Jos > > > > > --------------------------------------------------------------------- > > To unsubscribe from this mail list, you must leave the OASIS TC that > generates this mail. Follow this link to all your TCs in OASIS > at: > https://www.oasis-open.org/apps/org/workgroup/portal/my_workgroups.php > > - -- Patrick Durusau patrick@durusau.net Technical Advisory Board, OASIS (TAB) OpenDocument Format TC (OASIS), Project Editor ISO/IEC 26300 Co-Editor 13250-5 (Topic Maps) Another Word For It (blog): http://tm.durusau.net Homepage: http://www.durusau.net Twitter: patrickDurusau -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 iQIcBAEBAgAGBQJWh/CUAAoJEFPGsgi3Mgyc51MQAM1hs4gE+LW2Gh73w8OOybGA /CS+gINP8vW5KvP9kNzUPJ4SZiRavPs7lH/051MUxATiCc8N0/XbZRbp/KEmdAVn /BLSUkWcPW/2avASRqt4VrKxxxU8csPUg0n77B3QCsZ1PTtFEjHZvfBRcesGc354 g/SxRSJGLjbCEC/Ndfd2qlBfZ3EDQKc+WtheTdEhgnjAql7r6NQlFhs/i3KpGqVl FebJgCxN06hkX3JlK1K/MMbyaL1Gt1MYGZKgh4i0solN4/83NL0kzp+Bqm7cov6f UnKstwsLBiW0c20aJY0Gfs2KrRAU0l9nBWViCDnkgn8VqWm5ifi8FuqveDDwb9KC HZlCicLdtUdtqi8mTTxba0tk35rpgnA6vpYA1dw/CZX2gFsnOoIrSVuLoH3E4/N0 2rUy5z+ZDUEnuA2yGPxVkTJpTgfeCiVwO/yeWo2HvvfDLMLSKRd9ebNEUxPNUcQp gZVNBt1wOAve4pwOZjLJvdLCzdZODGRgkg7FtURbToA5VdzT8854CGUwMGDbRDUv 5fIn3C4oEdo5JPXncbxouZk4tqvJxF7vheGP4/x45GQqh6UYYr+PSFdHgEqj2FMu +8iOWDPYlSQ8/Yar4GhN/CfUKi6B8YI2LXYOHUxxXjanLJJiCUjWN8/ycBYu8E25 maaiHdCxaRpmpflyi50W =4Zx0 -----END PGP SIGNATURE-----