Uploaded image for project: 'HTML Form Entry Module'
  1. HTML Form Entry Module
  2. HTML-325

Improve parser so that it can handle   and other HTML-specific character codes within a form

    XMLWordPrintable

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: TBD
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: HTML Form Entry 3.7.0
    • Labels:
    • Complexity:
      Low

      Description

      The HFE module parses an HTML Form under the the assumption that the content of the form is xml data (that is, htmlform.getXhtmlData() must be valid XML).

      However, the common HTML character code,

       

      is NOT valid XML. Therefore, whenever the HtmlFormEntryUtil.stringToDocument() method is used to parse a form with this character code, an exception is thrown.

      The workaround is to use the ascii character code for a blank space:

       

      instead of

       

      However, it would be nice to allow the HFE module to be handle forms that contain common character codes.

      Possible ways to handle this:

      1) Manually do a search-and-replace to change

       

      into

       

      2) Figure out how to modify the DocumentBuidler created in HtmlFormEntryUtil.stringToDocument() so that it recognizes the

       

      entity... ie., modify the DTD that the parser uses for verification so that the equivalent of

      <!DOCTYPE some_name [ <!ENTITY nbsp "&#160;"> ]>
      

      is included in the DTD.

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              piyush9620 Piyush Kundnani
              Reporter:
              mogoodrich Mark Goodrich
              Votes:
              3 Vote for this issue
              Watchers:
              3 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved:

                  Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0 minutes
                  0m
                  Logged:
                  Time Spent - 1 hour
                  1h