noMisleadingCharacterClass

import { Tabs, TabItem } from '@astrojs/starlight/components';

<Tabs> <TabItem label="JavaScript (and super languages)" icon="seti:javascript"> ## Summary - Rule available since: `v1.5.0` - Diagnostic Category: [`lint/suspicious/noMisleadingCharacterClass`](/reference/diagnostics#diagnostic-category) - This rule is **recommended**, meaning it is enabled by default. - This rule has a [**safe**](/linter/#safe-fixes) fix. - The default severity of this rule is [**error**](/reference/diagnostics#error). - Sources: - Same as [`no-misleading-character-class`](https://eslint.org/docs/latest/rules/no-misleading-character-class)

How to configure

json

{
	"linter": {
		"rules": {
			"suspicious": {
				"noMisleadingCharacterClass": "error"
			}
		}
	}
}

Description

Disallow characters made with multiple code points in character class syntax.

Unicode includes the characters which are made with multiple code points. e.g. Á, 🇯🇵, 👨‍👩‍👦. A RegExp character class /[abc]/ cannot handle characters with multiple code points. For example, the character ❇️ consists of two code points: ❇ (U+2747) and VARIATION SELECTOR-16 (U+FE0F). If this character is in a RegExp character class, it will match to either ❇ or VARIATION SELECTOR-16 rather than ❇️. This rule reports the regular expression literals which include multiple code point characters in character class syntax.

Examples

Invalid

/^[Á]$/u;

<pre class="language-text"><code class="language-text">code-block.js:1:4 <a href="https://biomejs.dev/linter/rules/no-misleading-character-class">lint/suspicious/noMisleadingCharacterClass</a> ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ✖ A character class cannot match a character and a combining character. > 1 │ /^[Á]$/u; │ ^ 2 │ ℹ A character and a combining character forms a new character. Replace the character class with an alternation. </code></pre>

/^[❇️]$/u;

<pre class="language-text"><code class="language-text">code-block.js:1:4 <a href="https://biomejs.dev/linter/rules/no-misleading-character-class">lint/suspicious/noMisleadingCharacterClass</a> ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ✖ A character class cannot match a character and a combining character. > 1 │ /^[❇️]$/u; │ ^ 2 │ ℹ A character and a combining character forms a new character. Replace the character class with an alternation. </code></pre>

/^[👶🏻]$/u;

<pre class="language-text"><code class="language-text">code-block.js:1:4 <a href="https://biomejs.dev/linter/rules/no-misleading-character-class">lint/suspicious/noMisleadingCharacterClass</a> ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ✖ A character class cannot match an emoji with a skin tone modifier. > 1 │ /^[👶🏻]$/u; │ ^^^^ 2 │ ℹ Replace the character class with an alternation. </code></pre>

/^[🇯🇵]$/u;

<pre class="language-text"><code class="language-text">code-block.js:1:4 <a href="https://biomejs.dev/linter/rules/no-misleading-character-class">lint/suspicious/noMisleadingCharacterClass</a> ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ✖ A character class cannot match a pair of regional indicator symbols. > 1 │ /^[🇯🇵]$/u; │ ^^ 2 │ ℹ A pair of regional indicator symbols encodes a country code. Replace the character class with an alternation. </code></pre>

/^[👨‍👩‍👦]$/u;

<pre class="language-text"><code class="language-text">code-block.js:1:4 <a href="https://biomejs.dev/linter/rules/no-misleading-character-class">lint/suspicious/noMisleadingCharacterClass</a> ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ✖ A character class cannot match a joined character sequence. > 1 │ /^[👨‍👩‍👦]$/u; │ ^^^^ 2 │ ℹ A zero width joiner composes several emojis into a new one. Replace the character class with an alternation. </code></pre>

/^[👍]$/; // surrogate pair without u flag

<pre class="language-text"><code class="language-text">code-block.js:1:4 <a href="https://biomejs.dev/linter/rules/no-misleading-character-class">lint/suspicious/noMisleadingCharacterClass</a> FIXABLE ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ ✖ A character class cannot match a surrogate pair. Add the 'u' unicode flag to match against them. > 1 │ /^[👍]$/; // surrogate pair without u flag │ ^^ 2 │ ℹ A surrogate pair forms a single codepoint, but is encoded as a pair of two characters. Without the unicode flag, the regex matches a single surrogate character. ℹ Safe fix: Add unicode u flag to regex 1 │ /^[👍]$/u;·//·surrogate·pair·without·u·flag │ + </code></pre>

Valid

/^[abc]$/;
/^[👍]$/u;
/^[\q{👶🏻}]$/v;

</TabItem> </Tabs>

How to configure

Description

Examples

Invalid

Valid

Related links