Hello Monks, I am trying to convert a text file into xml using following code:
#!/usr/bin/perl use strict; use warnings; use XML::Writer; use XML::Simple; use XML::LibXML; my $out; my $xml = XML::Writer->new(OUTPUT => \$out, DATA_MODE => 1, DATA_INDEN +T => ' '); $xml->xmlDecl(); $xml->startTag('doc'); my $check_1 = 0; open(my $fh, "<", "20150625163139.xml") or die "Failed to open file: $!\n"; while(<$fh>) { chomp; next if !length; my ($string1, $string2, $subscript_name, $subscript_value) = / ^(.*?):: ([^\s]+) \.([^\s]+)\s+= \s(.*) /x; if ( $check_1 == 0 ) { $xml->startTag($string1); $check_1 += 1; } $xml->startTag($string2); $xml->dataElement($subscript_name => $subscript_value); $xml->endTag(); } $xml->endTag(); $xml->endTag(); $xml->end(); print $out; close $fh;
The file 20150625163139.xml contains 366 lines with format:
GI-eSTB-MIB-NPH::eSTBGeneralErrorCode.0 = INTEGER: 0 GI-eSTB-MIB-NPH::eSTBGeneralConnectedState.0 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBGeneralPlatformID.0 = INTEGER: 2076 GI-eSTB-MIB-NPH::eSTBGeneralFamilyID.0 = INTEGER: 25 GI-eSTB-MIB-NPH::eSTBGeneralModelID.0 = INTEGER: 60436 GI-eSTB-MIB-NPH::eSTBGeneralUnitAddressID.0 = STRING: 000-00802-49393- +076 GI-eSTB-MIB-NPH::eSTBGeneralSettopMac.0 = STRING: b8:16:19:28:18:f3 GI-eSTB-MIB-NPH::eSTBGeneralRemodChan.0 = INTEGER: 3 GI-eSTB-MIB-NPH::eSTBGeneralSettopTime.0 = INTEGER: 1119302620 GPS GI-eSTB-MIB-NPH::eSTBPurchaseStatusUnsentPurchases.0 = INTEGER: 0 GI-eSTB-MIB-NPH::eSTBPurchaseStatusUnackPurchases.0 = INTEGER: 0 GI-eSTB-MIB-NPH::eSTBPurchaseStatusLastSeqNumPurchases.0 = INTEGER: 0 GI-eSTB-MIB-NPH::eSTBPurchaseStatusLastReportBackTimePurchases.0 = INT +EGER: 1118516578 GI-eSTB-MIB-NPH::eSTBPurchaseStatusIppvStatus.0 = INTEGER: false(2) GI-eSTB-MIB-NPH::eSTBOobFrequency.0 = INTEGER: 75250000 GI-eSTB-MIB-NPH::eSTBOobCarrierLock.0 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBOobLostLockCount.0 = Counter32: 0 GI-eSTB-MIB-NPH::eSTBOobDataPresent.0 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBOobEMMDataPresent.0 = INTEGER: false(2) GI-eSTB-MIB-NPH::eSTBOobSNRValue.0 = INTEGER: 24.9 GI-eSTB-MIB-NPH::eSTBOobSNRState.0 = INTEGER: good(4) GI-eSTB-MIB-NPH::eSTBOobAGCValue.0 = INTEGER: 16 GI-eSTB-MIB-NPH::eSTBOobAGCState.0 = INTEGER: good(4) GI-eSTB-MIB-NPH::eSTBOobNetworkPid.0 = INTEGER: 1911 GI-eSTB-MIB-NPH::eSTBOobEMMPid.0 = INTEGER: 5379 GI-eSTB-MIB-NPH::eSTBOobEMMProviderID.0 = INTEGER: 1 GI-eSTB-MIB-NPH::eSTBInBandNumberOfTuners.0 = INTEGER: 2 GI-eSTB-MIB-NPH::eSTBTunerIndex.1 = INTEGER: 1 GI-eSTB-MIB-NPH::eSTBTunerIndex.2 = INTEGER: 2 GI-eSTB-MIB-NPH::eSTBInBandTunerModulationMode.1 = INTEGER: qam256(3) GI-eSTB-MIB-NPH::eSTBInBandTunerModulationMode.2 = INTEGER: qam256(3) GI-eSTB-MIB-NPH::eSTBInBandTunerCarrierLock.1 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBInBandTunerCarrierLock.2 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBInBandTunerPCRLock.1 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBInBandTunerPCRLock.2 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBInBandTunerDataLock.1 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBInBandTunerDataLock.2 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBInBandTunerEMMDataPresent.1 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBInBandTunerEMMDataPresent.2 = INTEGER: true(1) GI-eSTB-MIB-NPH::eSTBInBandTunerFrequency.1 = INTEGER: 195000000 GI-eSTB-MIB-NPH::eSTBInBandTunerFrequency.2 = INTEGER: 501000000 GI-eSTB-MIB-NPH::eSTBInBandTunerAGCValue.1 = INTEGER: 0 GI-eSTB-MIB-NPH::eSTBInBandTunerAGCValue.2 = INTEGER: 0 GI-eSTB-MIB-NPH::eSTBInBandTunerAGCState.1 = INTEGER: poor(2) GI-eSTB-MIB-NPH::eSTBInBandTunerAGCState.2 = INTEGER: poor(2) GI-eSTB-MIB-NPH::eSTBInBandTunerSNRValue.1 = INTEGER: 42.0
When I run the above code for this file, I get error:
Code point \u0016 is not a valid character in XML at ./<script_name>.p +l line 34

Where line 34 is

$xml->dataElement($subscript_name => $subscript_value);

When I remove all the lines from the file 20150625163139.xml and keep only these two lines

GI-eSTB-MIB-NPH::eSTBGeneralErrorCode.0 = INTEGER: 0 GI-eSTB-MIB-NPH::eSTBGeneralConnectedState.0 = INTEGER: true(1)

The same code runs fine and generates following xml

<?xml version="1.0"?> <doc> <GI-eSTB-MIB-NPH> <eSTBGeneralErrorCode> <0>INTEGER: 0</0> </eSTBGeneralErrorCode> <eSTBGeneralConnectedState> <0>INTEGER: true(1)</0> </eSTBGeneralConnectedState> </GI-eSTB-MIB-NPH> </doc>
I searched for error: "Code point \u0016 is not a valid character in XML at ./Call_to_snmpwalk.pl line 32" It seems that this error is being generated due to control characters present in the text which are not allowed in xml. So I have two options:

1) Remove these control characters from the file and then print: I have tried this using

perl -pe's/\x08//g' <20150625163139.xml >20150625163139.xml

But this gives error: Bad name after g' at <script_name>.pl line 13.

2) To actually generate an xml (actual .xml file) from code and put the text that is converted in xml into this file and then read it. Do anyone have any suggestions on point 1 or 2?


In reply to [Solved]:How to remove error: Code point \u0016 is not a valid character in XML by Perl300

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.