User Guide

Table Of Contents
Processing a request in ColdFusion 411
Determining the character encoding of a ColdFusion page
When a request for a ColdFusion page occurs, ColdFusion opens the page, processes the content,
and returns the results back to the browser of the requestor. In order to process the ColdFusion
page, though, ColdFusion has to interpret the page content.
One piece of information used by ColdFusion is the Byte Order Mark (BOM) in a ColdFusion
page. The BOM is a special character at the beginning of a text stream that specifies the order of
bytes in multibyte characters used by the page. The following table lists the common BOM
values:
To insert a BOM character in a CFML page easily, your editor must support BOM characters.
Many web page development tools support insertion of these characters, including Macromedia
Dreamweaver MX, which automatically sets the BOM based on the Page Properties Document
Encoding selection.
If your page does not contain a BOM, you can use the
cfprocessingdirective tag to set the
character encoding of the page. If you insert the
cfprocessingdirective tag on a page that has
a BOM, the information specified by the
cfprocessingdirective tag must be the same as for
the BOM; otherwise, ColdFusion issues an error.
The following procedure describes how ColdFusion recognizes the encoding format of a
ColdFusion page.
To determine the page encoding (performed by ColdFusion):
1.
Use the BOM, if specified on the page.
Macromedia recommends that you use BOM characters in your files.
2.
Use the pageEncoding attribute of the cfprocessingdirective tag, if specified. For detailed
information on how to use this attribute, see the
cfprocessingdirective tag in CFML
Reference.
3.
Default to the JVM default file character encoding. By default, this is the operating system
default character encoding. To specify the JVM default file character encoding, use the
-Dfile.encoding= switch in the JVM Arguments field of the ColdFusion MX Administrator
Java and JVM Settings page.
Encoding BOM signature
UTF-8 EF BB BF
UTF-16 Big Endian FE FF
UTF-16 Little Endian FF FE