# swift-html-entities **Repository Path**: tlisp/swift-html-entities ## Basic Information - **Project Name**: swift-html-entities - **Description**: No description available - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2022-01-24 - **Last Updated**: 2022-01-24 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # HTMLEntities [![Build Status - Master](https://api.travis-ci.org/Kitura/swift-html-entities.svg?branch=master)](https://travis-ci.org/Kitura/swift-html-entities) ![macOS](https://img.shields.io/badge/os-macOS-green.svg?style=flat) ![Linux](https://img.shields.io/badge/os-linux-green.svg?style=flat) ![Apache 2](https://img.shields.io/badge/license-Apache2-blue.svg?style=flat) [![codecov](https://codecov.io/gh/Kitura/swift-html-entities/branch/master/graph/badge.svg)](https://codecov.io/gh/Kitura/swift-html-entities) [![Carthage compatible](https://img.shields.io/badge/Carthage-compatible-4BC51D.svg?style=flat)](https://github.com/Carthage/Carthage) ## Summary Pure Swift HTML encode/decode utility tool for Swift. Includes support for HTML5 named character references. You can find the list of all 2231 HTML5 named character references [here](https://www.w3.org/TR/html5/syntax.html#named-character-references). `HTMLEntities` can escape ALL non-ASCII characters as well as the characters `<`, `>`, `&`, `"`, `’`, as these five characters are part of the HTML tag and HTML attribute syntaxes. In addition, `HTMLEntities` can unescape encoded HTML text that contains decimal, hexadecimal, or HTML5 named character references. ## API Documentation API documentation for `HTMLEntities` is located [here](https://kitura.github.io/swift-html-entities/). ## Features * Supports HTML5 named character references (`NegativeMediumSpace;` etc.) * HTML5 spec-compliant; strict parse mode recognizes [parse errors](https://www.w3.org/TR/html5/syntax.html#tokenizing-character-references) * Supports decimal and hexadecimal escapes for all characters * Simple to use as functions are added by way of extending the default `String` class * Minimal dependencies; implementation is completely self-contained ## Version Info Latest release of `HTMLEntities` requires Swift 4.0 and higher. ## Installation ### Via Swift Package Manager Add `HTMLEntities` to your `Package.swift`: ```swift import PackageDescription let package = Package( name: "", ... dependencies: [ .package(url: "https://github.com/Kitura/swift-html-entities.git", from: "3.0.0") ] // Also, make sure to add HTMLEntities to your package target's dependencies ) ``` ### Via CocoaPods Add `HTMLEntities` to your `Podfile`: ``` target '' do pod 'HTMLEntities', :git => 'https://github.com/Kitura/swift-html-entities.git' end ``` ### Via Carthage Add `HTMLEntities` to your `Cartfile`: ``` github "Kitura/swift-html-entities" ``` ## Usage ```swift import HTMLEntities // encode example let html = "" print(html.htmlEscape()) // Prints "<script>alert("abc")</script>" // decode example let htmlencoded = "<script>alert("abc")</script>" print(htmlencoded.htmlUnescape()) // Prints "" ``` ## Advanced Options `HTMLEntities` supports various options when escaping and unescaping HTML characters. ### Escape Options #### `allowUnsafeSymbols` Defaults to `false`. Specifies if unsafe ASCII characters should be skipped or not. ```swift import HTMLEntities let html = "

\"café\"

" print(html.htmlEscape()) // Prints "<p>"café"</p>" print(html.htmlEscape(allowUnsafeSymbols: true)) // Prints "

\"café\"

" ``` #### `decimal` Defaults to `false`. Specifies if decimal character escapes should be used instead of hexadecimal character escapes whenever numeric character escape is used (i.e., does not affect named character references escapes). The use of hexadecimal character escapes is recommended. ```swift import HTMLEntities let text = "한, 한, ế, ế, 🇺🇸" print(text.htmlEscape()) // Prints "한, 한, ế, ế, 🇺🇸" print(text.htmlEscape(decimal: true)) // Prints "한, 한, ế, ế, 🇺🇸" ``` #### `encodeEverything` Defaults to `false`. Specifies if all characters should be escaped, even if some characters are safe. If `true`, overrides the setting for `allowUnsafeSymbols`. ```swift import HTMLEntities let text = "A quick brown fox jumps over the lazy dog" print(text.htmlEscape()) // Prints "A quick brown fox jumps over the lazy dog" print(text.htmlEscape(encodeEverything: true)) // Prints "A quick brown fox jumps over the lazy dog" // `encodeEverything` overrides `allowUnsafeSymbols` print(text.htmlEscape(allowUnsafeSymbols: true, encodeEverything: true)) // Prints "A quick brown fox jumps over the lazy dog" ``` #### `useNamedReferences` Defaults to `false`. Specifies if named character references should be used whenever possible. Set to `false` to always use numeric character references, i.e., for compatibility with older browsers that do not recognize named character references. ```swift import HTMLEntities let html = "" print(html.htmlEscape()) // Prints “<script>alert("abc")</script>” print(html.htmlEscape(useNamedReferences: true)) // Prints “<script>alert("abc")</script>” ``` #### Set Escape Options Globally HTML escape options can be set globally so that you don't have to set them everytime you want to escape a string. The options are managed in the `String.HTMLEscapeOptions` struct. ```swift import HTMLEntities // set `useNamedReferences` to `true` globally String.HTMLEscapeOptions.useNamedReferences = true let html = "" // Now, the default behavior of `htmlEscape()` is to use named character references print(html.htmlEscape()) // Prints “<script>alert("abc")</script>” // And you can still go back to using numeric character references only print(html.htmlEscape(useNamedReferences: false)) // Prints "<script>alert("abc")</script>" ``` ### Unescape Options #### `strict` Defaults to `false`. Specifies if HTML5 parse errors should be thrown or simply passed over. **Note**: `htmlUnescape()` is a throwing function if `strict` is used in call argument (no matter if it is set to `true` or `false`); `htmlUnescape()` is NOT a throwing function if no argument is provided. ```swift import HTMLEntities let text = "한" print(text.htmlUnescape()) // Prints "한" print(try text.htmlUnescape(strict: true)) // Throws a `ParseError.MissingSemicolon` instance // a throwing function because `strict` is passed in argument // but no error is thrown because `strict: false` print(try text.htmlUnescape(strict: false)) // Prints "한" ``` ## Acknowledgments `HTMLEntities` was designed to support some of the same options as [`he`](https://github.com/mathiasbynens/he), a popular Javascript HTML encoder/decoder. ## License Apache 2.0