Um, this isn't an XML file. An XML file should look something like this: > <?xml version="1.0" encoding="utf-8" ?> ........Regards, > Richie. > Mathematical Sciences Unit > HSL
It is for sure little complicated then a plain XML file. The format of binary file is according to XML schema. I have been able to get C parser going to get information from binary with one caveat - I have to manually read the XML schema and figure out "which byte means what in binary" and then code it in C. I do not know if it will be of interest or worthwhile to incorporate capability in R to interpret schema and decode binary file based on schema interpretation. Or may be this already exists and it is just that I do not know how to do it. Anyway thanks for looking into it. - Kulwinder Banipal > To: kbani...@hotmail.com > CC: dun...@wald.ucdavis.edu; r-help@r-project.org; > r-help-boun...@r-project.org > Subject: Re: [R] XML parse error > From: richard.cot...@hsl.gov.uk > Date: Thu, 21 May 2009 11:05:02 +0100 > >> I am trying to parse XML file ( binary hex) but get an error. >> Code I am using is: >> xsd = xmlTreeParse(system.file("exampleData", "norel.xsd", package = >> "XML"), isSchema =TRUE) doc = xmlInternalTreeParse(system. >> file("exampleData", "LogCallSummary.bin", package = "XML")) Start >> tag expected, '<' not found >> >> xmlParse command results in same error as well: >> f = system.file("exampleData", "LogCallSummary.bin", package = >> "XML")> doc = xmlParse(f)Start tag expected, '<' not found >> I am at beginner level with XML and will appreciate any help with >> this error or general guidance. >> Thanks >> Kulwinder Banipal >> >> file is: >> 0000000: 0281 0001 0201 0098 c1d5 c000 0000 0000 ................ >> 0000010: 000a c0a8 db35 0055 6000 00af 0001 0001 .....5.U`....... >> 0000020: 5f00 2200 4530 0000 4411 2233 4455 0f08 _.".E0..D."3DU.. >> 0000030: 0123 4567 8901 2340 0000 04d2 0000 0000 .#eg...@........ >> 0000040: 0000 0000 0002 0100 0001 0003 0303 0000 ................ >> 0000050: 0000 0000 0100 0000 6400 0000 0100 0000 ........d....... >> 0000060: 6401 0300 0900 00fe fe00 012f 0001 1111 d........../.... >> 0000070: 0101 0001 1111 0000 0001 0000 2200 0033 ............".. >> 30000080: 3306 0000 3333 0022 0000 1100 0000 0000 3...33."........ >> 0000090: 0033 3400 2300 0011 0000 0001 0000 3335 .34.#......... >> 3500000a0: 0024 0000 1100 0000 0200 0033 3600 2500 .$.........36.%. >> 00000b0: 0011 0000 0003 0000 3337 0026 0000 1100 ........37.&.... >> 00000c0: 0000 0400 0033 3800 2700 0011 0000 0005 .....38.'....... >> 00000d0: 5504 7700 8800 0044 4406 0000 2323 0099 U.w....DD...##.. >> 00000e0: 0100 0200 0000 0000 0023 2400 9901 0002 .........#$..... >> 00000f0: 0000 0001 0000 2325 0099 0100 0200 0000 ......#%........ >> 0000100: 0200 0023 2600 9901 0002 0000 0003 0000 ...#&........... >> 0000110: 2327 0099 0100 0200 0000 0400 0023 2800 #'...........#(. >> 0000120: 9901 0002 0000 0005 0102 0008 0100 0066 ............... >> f0000130: 6600 0055 5533 0000 0000 3400 0000 0a35 f..UU3....4.... >> 50000140: 0000 0014 3600 0000 1e37 0000 0028 3800 ....6....7...(8. >> 0000150: 0000 3239 0000 003c 3a00 0000 463b 0000 ..29...<:...F;.. >> 0000160: 0050 3c00 0000 5a00 0088 8800 0077 7744 .P<...Z...... >> wwD0000170: 0000 0000 4500 0000 0a46 0000 0014 4700 ....E....F.... >> G.0000180: 0000 1e48 0000 0028 4900 0000 324a 0000 ...H...(I...2J.. >> 0000190: 003c 4b00 0000 464c 0000 0050 4d00 0000 .<K...FL...PM... >> 00001a0: 5a02 2207 7766 6604 0500 0000 1100 0088 Z.".wff......... >> 00001b0: 8800 0000 0106 0000 0011 0000 8889 0000 ................ >> 00001c0: 0011 0700 0000 1100 0088 8a00 0000 2108 ..............!. >> 00001d0: 0000 0011 0000 888b 0000 0031 0405 0000 ...........1.... >> 00001e0: 0022 0000 0044 0000 0001 0600 0000 2200 ."...D........". >> 00001f0: 0000 4500 0000 1107 0000 0022 0000 0046 ..E........"... >> F0000200: 0000 0021 0800 0000 2200 0000 4700 0000 ...!...."...G... >> 0000210: 3106 0000 0001 0002 0003 0004 0005 0200 1............... >> 0000220: 0033 4400 0055 6609 0101 0202 0303 0404 .3D..Uf......... >> 0000230: 0505 0606 0707 0808 0909 0405 0000 0011 ................ >> 0000240: 0000 0044 0000 0022 0000 0088 0500 0000 ...D..."........ >> 0000250: 1200 0000 4500 0000 2300 0000 8905 0000 ....E...#....... >> 0000260: 0013 0000 0046 0000 0024 0000 008a 0500 .....F...$...... >> 0000270: 0000 1400 0000 4700 0000 2500 0000 8bfa > ......G...%.....0000280: ae > > Um, this isn't an XML file. An XML file should look something like this: > > <?xml version="1.0" encoding="utf-8" ?> > <tag> > <subtag>value</subtag> > </tag> > > The wikipedia entry on XML gives a reasonable intro to the format. > http://en.wikipedia.org/wiki/Xml > > Regards, > Richie. > > Mathematical Sciences Unit > HSL > > > ------------------------------------------------------------------------ > ATTENTION: > > This message contains privileged and confidential info...{{dropped:26}} ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.