Pi-hole regex extensions¶
Only match specific query types¶
You can amend the regular expressions by special keywords added at the end to fine-tune regular expressions to match only specific query types. In contrast to the description of OTHER
as being deprecated for storing queries in the database, it is still supported for regular expressions and will match all queries that are not explicitly covered by the other query types (see also example below).
Example:
abc;querytype=AAAA
will block
dig AAAA abc
but not
dig A abc
This allows you to do query type based denylisting and allowlisting.
Some user-provided examples are:
-
.*;querytype=!A
A regex denylist entry for blocking
AAAA
(in fact, everything else thanA
, call it "anti-A
") requests for all clients assigned to the same group. This has been mentioned to be beneficial for devices like Chromecast. You may want to fine-tune this further to specific domains. -
.*;querytype=PTR
A regex allowlist entry used to permit
PTR
lookups with the above "anti-A
" regex -
.*;querytype=ANY
A regex denylist entry to block
ANY
request network wide. -
.*;querytype=OTHER
A regex denylist entry to block
OTHER
request network wide. This rule will match, for instance, proprietary DNS requests using custom query types in the reserved range or queries for seldom used DNS record types likeIXFR
orAXFR
.
Note that multiple (comma-separated) query types can be specified at the same time, e.g., .*;querytype=A,AAAA
will match both A
and AAAA
requests. In a similar fashion, an inverted (!
modifier) list, e.g., .*;querytype=!A,AAAA
will match everything except A
and AAAA
requests.
Invert matching¶
Sometimes, it may be useful to be able to invert a regular expression altogether. Hence, we added the keyword ;invert
to achieve exactly this.
For instance,
^abc$;querytype=AAAA;invert
will not block abc
with type AAAA
(but everything else) for the clients assigned to the same groups. This inversion is independent for the query type, e.g.
^abc$;invert
will block not block abc
but everything else.
Specify reply type¶
Pi-hole allows you to configure the reply it serves when a regular expression matches a query. This can be controlled via the ;reply
keyword.
Valid options are:
;reply=nodata
(an empty answer will be provided);reply=nxdomain
("no such domain" will be provided, can cause unintended side-effects);reply=refused
(the query will be refused);reply=none
(the query will be silently dropped);reply=ip
(the Pi-hole's IP address if not overwritten by configuration options:reply.blocking.IPv4
/reply.blocking.IPv6
);reply=1.2.3.4
(any valid IPv4 address);reply=fe80::1234
(any valid IPv6 address)
Only one option should be specified. An exception to this rule are the last two options which may be specified at the same time to configure both an IPv4 and an IPv6 address:
- IPv4 only:
myregex;reply=1.2.3.4
will result in A 1.2.3.4
and AAAA ::
- IPv6 only:
myregex;reply=fe80::1234
will result in A 0.0.0.0
and AAAA fe80:1234
- IPv4 and IPv6:
myregex;reply=1.2.3.4;reply=fe80::1234
will result in A 1.2.3.4
and AAAA fe80:1234
Comments¶
You can specify comments within your regex using the syntax
(?#some comment here)
The comment can contain any characters except for a closing parenthesis )
(for the sole reason being the terminating element). The text in the comment is completely ignored by the regex parser and it used solely for readability purposes.
$ pihole-FTL regex-test "doubleclick.net" "(^|\.)doubleclick\.(?#TODO: We need to maybe support more than just .net here)net$"
FTL Regex test:
Domain: "doubleclick.net"
Regex: "(^|\.)doubleclick\.(?#TODO: We need to maybe support more than just .net here)net$"
Step 1: Compiling regex filter...
Compiled regex filter in 0.167 msec
Step 2: Checking domain...
Done in 0.032 msec
MATCH
Back-references¶
A back reference is a backslash followed by a single non-zero decimal digit d
. It matches the same sequence of characters matched by the d
th parenthesized subexpression.
Example:
"cat.foo.dog---cat%dog!foo" is matched by "(cat)\.(foo)\.(dog)---\1%\3!\2"
Another (more complex example is):
(1234|4321)\.(foo)\.(dog)--\1
MATCH: 1234.foo.dog--1234
MATCH: 4321.foo.dog--4321
NO MATCH: 1234.foo.dog--4321
Mind that the last line gives no match as \1
matches exactly the same sequence the first character group matched. And 4321
is not the same as 1234
even when both are valid replies for (1234|4321)
Back references are not defined for POSIX EREs (for BREs they are, surprisingly enough). We add them to ERE in the BRE style.
$ pihole-FTL regex-test "someverylongandmaybecomplexthing.foo.dog--someverylongandmaybecomplexthing" "(someverylongandmaybecomplexthing|somelesscomplexitem)\.(foo)\.(dog)--\1"
FTL Regex test:
Domain: "someverylongandmaybecomplexthing.foo.dog--someverylongandmaybecomplexthing"
Regex: "(someverylongandmaybecomplexthing|somelesscomplexitem)\.(foo)\.(dog)--\1"
Step 1: Compiling regex filter...
Compiled regex filter in 0.563 msec
Step 2: Checking domain...
Done in 0.031 msec
MATCH
More character classes for bracket expressions¶
A bracket expression specifies a set of characters by enclosing a nonempty list of items in brackets. Normally anything matching any item in the list is matched. If the list begins with ^
the meaning is negated; any character matching no item in the list is matched.
- Multiple characters:
[abc]
matchesa
,b
, andc
. - Character ranges:
[0-9]
matches any decimal digit. - Character classes:
[:alnum:]
alphanumeric characters[:alpha:]
alphabetic characters[:blank:]
blank characters[:cntrl:]
control characters[:digit:]
decimal digits (0 - 9)[:graph:]
all printable characters except space[:lower:]
lower-case letters (FTL matches case-insensitive by default)[:print:]
printable characters including space[:punct:]
printable characters not space or alphanumeric[:space:]
white-space characters[:upper:]
upper case letters (FTL matches case-insensitive by default)[:xdigit:]
hexadecimal digits
Furthermore, there are two shortcuts for some character classes:
\d
- Digit character (equivalent to[[:digit:]]
)\D
- Non-digit character (equivalent to[^[:digit:]]
)