Text Processingflags: g
Non-ASCII Character
Match runs of non-ASCII characters (anything outside U+0000–U+007F).
Try it in RegexPro →Available in
Pattern
regexengine-agnostic
[^\x00-\x7F]+ (flags: g)Raw source: [^\x00-\x7F]+
How it works
[^\x00-\x7F] is a negated character class: anything NOT in the ASCII range 0x00–0x7F. The trailing + groups consecutive non-ASCII characters into a single match (so `café` matches as `é`, `naïve` as `ï`, etc.). Useful for finding accented characters, emoji, CJK, and other Unicode in otherwise-ASCII source.
Examples
Input
Hello, café!Matches
é
Input
naïve résumé 🎉Matches
ïéé🎉
Input
plain ascii hereNo match
—Common use cases
- •Auditing source code for non-ASCII identifiers
- •Encoding-bug detection in legacy data
- •Building i18n test cases
- •Migrating between charsets
Related patterns
Non-Capturing Group (Image URL)
Text ProcessingUse non-capturing groups `(?:...)` to alternate without polluting the captured-groups list.
Tab Character
Text ProcessingMatch literal tab characters — the regex behind every formatter / linter that yells about indentation.
Hashtag
Text ProcessingMatch hashtags (# followed by word characters) in social media posts, including accented Latin characters.
Lazy / Non-Greedy Quantifier
Text ProcessingDemonstrate lazy quantifiers `+?` by matching the SHORTEST HTML-like tag rather than the longest greedy span.