[Solved] LibreOffice File format error found at SAXParse

Help with installation and general system troubleshooting questions concerning the office suite LibreOffice.

[Solved] LibreOffice File format error found at SAXParse

Postby chrisrw » Sat Dec 24, 2016 7:19 am

hello,
Please help me i have the same issue with my files.i will really appreciate if you fix it for me.
here is the link you can access it from.https://wetransfer.com/downloads/059862 ... 712/9fafa3
or https://drive.google.com/drive/my-drive the file is assignment ICT4D
thank you
chris
Last edited by Hagar Delest on Sat Jan 14, 2017 12:46 pm, edited 1 time in total.
Reason: tagged [Solved].
libreoffice5.2.4.2
ubuntu 14.04 LTS 64BIT
chrisrw
 
Posts: 2
Joined: Fri Dec 23, 2016 9:35 am

Re: [Solved] LibreOffice File format error found at SAXParse

Postby chrisrw » Sat Dec 24, 2016 7:59 am

Hello,
Please help me.i have the same issue with my file and i have to submit it not later than today.
the fil is too big so you can access it on this link https://we.tl/khVn4jDjRg

I will appreciate your help.
thank you
chris
libreoffice5.2.4.2
ubuntu 14.04 LTS 64BIT
chrisrw
 
Posts: 2
Joined: Fri Dec 23, 2016 9:35 am

Re: [Solved] LibreOffice File format error found at SAXParse

Postby RoryOF » Sat Dec 24, 2016 10:47 am

chrisrw wrote:hello,
Please help me i have the same issue with my files.i will really appreciate if you fix it for me.
here is the link you can access it from.https://wetransfer.com/downloads/059862 ... 712/9fafa3
or https://drive.google.com/drive/my-drive the file is assignment ICT4D
thank you
chris


Try opening this using AbWord.
Apache OpenOffice 4.1.10 on Xubuntu 20.04.2 (mostly 64 bit version) and very infrequently on Win2K/XP
User avatar
RoryOF
Moderator
 
Posts: 32715
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: [Solved] LibreOffice File format error found at SAXParse

Postby John_Ha » Sat Dec 24, 2016 1:20 pm

Try this file https://dl.dropboxusercontent.com/u/100 ... UTLINE.odt.

I merely opened it with Apache OpenOffice and saved it. Please let us know if any content was lost so we can recommend this method to others if it works OK.
LO 6.4.4.2, Windows 10 Home 64 bit

See the Writer Guide, the Writer FAQ, the Writer Tutorials and Writer for students.

Remember: Always save your Writer files as .odt files. - see here for the many reasons why.
John_Ha
Volunteer
 
Posts: 8492
Joined: Fri Sep 18, 2009 5:51 pm
Location: UK

Re: [Solved] LibreOffice File format error found at SAXParse

Postby RoryOF » Sat Dec 24, 2016 1:29 pm

I got nine pages in the file with OpenOffice, thirteen in Abiword (from extra content). There was no error reported on examination of the .xml file.
Apache OpenOffice 4.1.10 on Xubuntu 20.04.2 (mostly 64 bit version) and very infrequently on Win2K/XP
User avatar
RoryOF
Moderator
 
Posts: 32715
Joined: Sat Jan 31, 2009 9:30 pm
Location: Ireland

Re: [Solved] LibreOffice File format error found at SAXParse

Postby John_Ha » Sat Dec 24, 2016 2:42 pm

chrisw

Your method of formatting leaves a lot to be desired. Go View > Non printing characters (or click the ¶ icon) to see what you are doing.

Ideally you should be using Styles, but if you want to apply direct formatting, you should be using things like Format > Paragraph; or Align left/centre/right; and/or using the built in tabs or setting your own left/centre and right tabs. Some text is justified (stretched to reach both margins) some is not. You are using multiple different fonts. Some paragraphs start outside the page margins. Different paragraphs have Format > Paragraph > Indents and spacing > Gap after paragraph etc set to different values.

See the uploaded file here where I have removed all formatting and set all paragraphs to 2mm gap below. Now apply your formatting - I doubt it will take you more than 10 minutes and it will look much more professional. Learn about the Wrap (don't use multiple new paragraphs!!) and Anchor options for the images. Right click an image for the options including adding a caption.

Can I suggest you read the up to date Writer guide in my footer - it is written for students.

Clipboard01.png
Poor formatting
Attachments
unformatted.odt
File with images removed to reduce size for uploading
(31.24 KiB) Downloaded 180 times
Last edited by John_Ha on Sun Dec 25, 2016 1:26 am, edited 1 time in total.
LO 6.4.4.2, Windows 10 Home 64 bit

See the Writer Guide, the Writer FAQ, the Writer Tutorials and Writer for students.

Remember: Always save your Writer files as .odt files. - see here for the many reasons why.
John_Ha
Volunteer
 
Posts: 8492
Joined: Fri Sep 18, 2009 5:51 pm
Location: UK

Re: [Solved] LibreOffice File format error found at SAXParse

Postby John_Ha » Sat Dec 24, 2016 4:30 pm

chrisw

See this file - it contains all the text from your file I downloaded but has no formatting.

I got all the text out by:

1 Extract \word\document.xml by un-ZIPping the .docx file
2 Open document.xml with Notepad++
3 Go Search > Replace ..., with search argument <[^>]+> and replace argument is blank. Be sure to tick Regular Expressions. Click Replace All.

This removes all the XML tags and leaves just the text.

Clipboard01.png
Notepad++ Search using Replace to remove all XML tags

You cannot do this in Writer because the XML contains more than 64k (actually 65,536) characters all in one single paragraph. You can paste the entire XML into Writer but as soon as you then try to do anything, Writer crashes because you have exceeded the Writer 64k paragraph limit. If you want to use Writer you need to split document.xml into parts where each part has less than 64k characters.
Attachments
text only.odt
document.xml with all XML tags removed.
(36.71 KiB) Downloaded 185 times
LO 6.4.4.2, Windows 10 Home 64 bit

See the Writer Guide, the Writer FAQ, the Writer Tutorials and Writer for students.

Remember: Always save your Writer files as .odt files. - see here for the many reasons why.
John_Ha
Volunteer
 
Posts: 8492
Joined: Fri Sep 18, 2009 5:51 pm
Location: UK


Return to LibreOffice

Who is online

Users browsing this forum: No registered users and 1 guest