New extended selectors: matches-attr, matches-path, upward #4931

chrmod · 2025-06-09T08:08:01Z

No description provided.

packages/adblocker-extended-selectors/src/eval.ts

philipp-classen · 2025-06-10T08:51:31Z

packages/adblocker-extended-selectors/src/eval.ts

+      // Convert the pattern to a RegExp
+      // Escape special characters except for regex patterns
+      const escapedPattern = pattern.replace(/[.*+?^${}()|[\]\\]/g, '\\$&');
+      const regex = new RegExp(escapedPattern);


(same comment about caching compiled regular expressions)

packages/adblocker-extended-selectors/src/eval.ts

remusao · 2025-06-11T12:55:40Z

packages/adblocker-extended-selectors/src/eval.ts

+      : findAncestorBySelector(c, argument);
+
+    if (ancestor === null) {
+      return [];


Should we keep processing other candidates instead of doing an early return here?

that was a great catch! test to cover that case added

remusao · 2025-06-11T12:57:43Z

packages/adblocker-extended-selectors/src/eval.ts

+
+      const path = globalThis.window.location.pathname;
+
+      const pattern = argument.replace(/^\/|\/$/g, '');


Should we handle cases where patterns might end with /i for case-insensitive matching for example?

this was replaced with a simpler approach to detect a pair of wrapping slashes

remusao · 2025-06-11T12:58:11Z

packages/adblocker-extended-selectors/src/eval.ts

+        return false;
+      }
+
+      const escapedPattern = pattern.replace(/[.*+?^${}()|[\]\\]/g, '\\$&');


I think we have a utility for this already in network filters parsing code, maybe it's worth sharing?

by looking at existing filters:

href=/[a-zA-Z0-9]{100,}/ /^on/=/event/ /-h?ref/ href=/https:\/\/thehackernews\.uk\/[a-zA-Z0-9]{4,}/ href=/^\/[-a-z]+\?[a-z]{2,}=/ href=/(^|audiobookbay\.lu)\/[-a-z0-9]+$/ class=/^[a-zA-Z]{2}$/ href="/__cft__\[0\]=[-\w]{265,}/" class=/^[a-zA-Z]{2}$/ alt="/.*\shttps:\/\/t\.co\/[\w]{10}$/" id=/[a-zA-Z]{40,}/ href="/__cft__\[0\]=[-\w]{290,}/" href=/[a-zA-Z0-9]{100,}/

all of them should be handled as regexp without a need for escaping. New implementation does not have the escaping.

philipp-classen · 2025-06-11T15:19:08Z

packages/adblocker-extended-selectors/src/eval.ts

+        for (const attr of element.attributes) {
+          if (regex.test(attr.name)) {
+            attrName = attr.name;
+            break;


At this point, there is a match. But if there is no match, then arguably returning false would be clearer. As I understand, now it will lookup the regular expression (e.g. /foo/) as the attribute. I think, it is guaranteed to find nothing (getAttribute will return null), since it is no valid attribute key, so it is technically correct.

But the straightforward way would be to look for a matching attribute. And if there is none, immediately exit.

packages/adblocker-extended-selectors/src/eval.ts

Co-authored-by: Philipp Claßen <philipp.classen@posteo.de>

philipp-classen · 2025-06-12T11:33:59Z

packages/adblocker-extended-selectors/src/types.ts

@@ -35,7 +35,7 @@ export type PseudoClass = Base & {
  type: 'pseudo-class';
  name: string;
  argument: string | undefined;
-  subtree: AST | undefined;
+  subtree?: AST | undefined;


Suggested change

subtree?: AST | undefined;

subtree?: AST;

Also, argument?: string; above (for consistency)

reverted to old code - this one should never have been committed

remusao · 2025-06-14T13:33:06Z

packages/adblocker-extended-selectors/src/eval.ts

+        return false;
+      }
+
+      const path = globalThis.window.location.pathname;


Nit. Wondering if we're fine accessing globals from this function or if we want to provide everything through arguments. Not sure if relied on globals previously.

remusao · 2025-06-14T13:34:12Z

packages/adblocker-extended-selectors/src/eval.ts

+        return false;
+      }
+
+      const path = globalThis.window.location.pathname;


The documentation from uBO specifies that this should match on path + query but here we only consider path. See: https://github.com/gorhill/uBlock/wiki/Procedural-cosmetic-filters#subjectmatches-patharg

remusao · 2025-06-14T13:35:35Z

packages/adblocker-extended-selectors/src/eval.ts

+      const path = globalThis.window.location.pathname;
+
+      let pattern = argument;
+      if (pattern.startsWith('/') && pattern.endsWith('/')) {


I think you mentioned it wasn't used in any filters out there but in case there is any modifier like i (e.g. /foo/i) do we want to handle or maybe detect such cases in filters builder (or in unit tests)? Otherwise they would silently fail.

remusao · 2025-06-14T13:36:55Z

packages/adblocker-extended-selectors/src/eval.ts

+      }
+
+      const indexOfEqual = argument.indexOf('=');
+      let namePattern, valuePattern;


I think we need to handle cases where the name is quoted as well here.

A declaration in the form name="value" or "name"="value",

From: https://github.com/gorhill/uBlock/wiki/Procedural-cosmetic-filters#subjectmatches-attrarg

From the parsing algorithm of uBO, it looks like quotes wrapping both name and value are optional. It basically shares algorithm with other scriptlet functions, so we need to support the following forms: "literal", 'literal', /regex/, "/regexLiteral/", '/regexLiteral/'.

packages/adblocker-extended-selectors/src/eval.ts

philipp-classen · 2025-06-16T16:07:19Z

packages/adblocker-extended-selectors/src/eval.ts

+        valuePattern = argument.slice(indexOfEqual + 1);
+      }
+
+      namePattern = stripsWrappingQuotes(namePattern);


If we handle quoted keys, do we have to handle arbitrary strings like that?

"x=y"=z

(If so, we cannot split by "=" in the first step, but have to strip the quotes in the same step)

philipp-classen · 2025-06-16T16:20:52Z

packages/adblocker-extended-selectors/src/eval.ts

+    }
+
+    const distance = parseInt(argument, 10);
+    const ancestor = !Number.isNaN(distance)


Best to guard the parsing with Number.isInteger(argument) before.

Currently, "x3" will run in the NaN path, while "3x" will succeed (parsed as "3").

packages/adblocker-extended-selectors/src/eval.ts

philipp-classen · 2025-06-16T18:01:08Z

packages/adblocker-extended-selectors/src/eval.ts

+    if (ancestor === null) {
+      continue;
+    }
+    ancestors.add(ancestor);


Suggested change

if (ancestor === null) {

continue;

}

ancestors.add(ancestor);

if (ancestor !== null) {

ancestors.add(ancestor);

}

Co-authored-by: Philipp Claßen <philipp.classen@posteo.de>

philipp-classen · 2025-06-17T13:37:13Z

packages/adblocker-extended-selectors/src/eval.ts

+
+  if (after.length > 0) {
+    if (after[0].type === 'pseudo-class' && after[0].name === 'upward') {
+      return Array.from(ancestors).flatMap((a) =>


Arguably, we should also filter out duplicates here.

packages/adblocker-extended-selectors/src/eval.ts

seia-soto · 2025-06-18T03:22:57Z

packages/adblocker-extended-selectors/src/eval.ts

+    } else if (selector.name === 'upward') {
+      // :upward is handled in querySelectorAll
+      return false;


We don't need this section, we can just fall to line 77

seia-soto · 2025-06-18T03:23:49Z

packages/adblocker-extended-selectors/src/eval.ts

+  let ancestor: Element | null = element.parentElement;
+  while (ancestor !== null) {
+    if (ancestor.matches(selector)) {
+      return ancestor;


I wonder if transforming this into selector: e.g. documentElement.querySelectorAll would make sense. This way, we don't need to loop.

chrmod added 4 commits June 9, 2025 10:07

WIP: additional extended selectors

bc134f8

upward

598119a

cleanup

3879bc4

cleanup

9e86f62

chrmod changed the title ~~WIP: additional extended selectors~~ New extended selectors: matches-attr, matches-path, upward Jun 9, 2025

chrmod marked this pull request as ready for review June 9, 2025 09:27

chrmod requested a review from remusao as a code owner June 9, 2025 09:27

chrmod added the PR: New Feature 🚀 Increment minor version when merged label Jun 9, 2025

philipp-classen reviewed Jun 10, 2025

View reviewed changes

fix test

975714b

chrmod force-pushed the extended-selectors branch from d2892e2 to 975714b Compare June 11, 2025 11:00

remusao reviewed Jun 11, 2025

View reviewed changes

Improve matches-attr

45cd55d

philipp-classen reviewed Jun 11, 2025

View reviewed changes

chrmod added 2 commits June 11, 2025 21:02

upward: fix orphan candidates

176558a

matches-attr: quick exit

4b2ad41

philipp-classen reviewed Jun 12, 2025

View reviewed changes

packages/adblocker-extended-selectors/src/eval.ts Outdated Show resolved Hide resolved

philipp-classen reviewed Jun 12, 2025

View reviewed changes

packages/adblocker-extended-selectors/src/eval.ts Outdated Show resolved Hide resolved

philipp-classen reviewed Jun 12, 2025

View reviewed changes

packages/adblocker-extended-selectors/src/eval.ts Outdated Show resolved Hide resolved

Cleanup

f7b1286

philipp-classen reviewed Jun 12, 2025

View reviewed changes

packages/adblocker-extended-selectors/src/eval.ts Outdated Show resolved Hide resolved

chrmod and others added 3 commits June 12, 2025 13:00

Uncomment pending test

ea41a73

Update packages/adblocker-extended-selectors/src/eval.ts

904b362

Co-authored-by: Philipp Claßen <philipp.classen@posteo.de>

Update packages/adblocker-extended-selectors/src/eval.ts

756531b

Co-authored-by: Philipp Claßen <philipp.classen@posteo.de>

chrmod requested review from remusao and philipp-classen June 12, 2025 11:13

philipp-classen reviewed Jun 12, 2025

View reviewed changes

Cleanup

8e495a1

philipp-classen approved these changes Jun 12, 2025

View reviewed changes

remusao reviewed Jun 14, 2025

View reviewed changes

chrmod added 7 commits June 16, 2025 17:15

parse regexp

644f5ba

matches-attr: strip wrapping quotes

5613239

Some more types

83423c5

matches-attr: handle multiple arguments matching name pattern

b4f468c

Cleanup

f100b76

upward: ignore duplicates

86be409

Cleanup

47c32e5

philipp-classen reviewed Jun 16, 2025

View reviewed changes

packages/adblocker-extended-selectors/src/eval.ts Outdated Show resolved Hide resolved

philipp-classen reviewed Jun 16, 2025

View reviewed changes

packages/adblocker-extended-selectors/src/eval.ts Show resolved Hide resolved

philipp-classen self-requested a review June 16, 2025 17:38

philipp-classen reviewed Jun 16, 2025

View reviewed changes

packages/adblocker-extended-selectors/src/eval.ts Outdated Show resolved Hide resolved

philipp-classen reviewed Jun 16, 2025

View reviewed changes

chrmod and others added 2 commits June 16, 2025 21:02

Update packages/adblocker-extended-selectors/src/eval.ts

55a96b7

Co-authored-by: Philipp Claßen <philipp.classen@posteo.de>

Update packages/adblocker-extended-selectors/src/eval.ts

9220a5a

Co-authored-by: Philipp Claßen <philipp.classen@posteo.de>

philipp-classen reviewed Jun 17, 2025

View reviewed changes

packages/adblocker-extended-selectors/src/eval.ts Outdated Show resolved Hide resolved

seia-soto reviewed Jun 18, 2025

View reviewed changes

more tests

6026af5

seia-soto mentioned this pull request Jun 22, 2025

adblocker-extended-selectors: general improvements #4965

Open

chrmod mentioned this pull request Jun 23, 2025

refactor: :upward #4953

Merged

seia-soto and others added 2 commits June 23, 2025 11:02

refactor: :upward (#4953)

5b445dd

Cleanup

42f5f5a

chrmod requested a review from seia-soto June 23, 2025 09:46

seia-soto approved these changes Jun 23, 2025

View reviewed changes

philipp-classen approved these changes Jun 23, 2025

View reviewed changes

chrmod merged commit abd2894 into master Jun 23, 2025
4 checks passed

chrmod deleted the extended-selectors branch June 23, 2025 09:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

New extended selectors: matches-attr, matches-path, upward #4931

New extended selectors: matches-attr, matches-path, upward #4931

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!


		const path = globalThis.window.location.pathname;

		const pattern = argument.replace(/^\/\|\/$/g, '');

New extended selectors: matches-attr, matches-path, upward #4931

New extended selectors: matches-attr, matches-path, upward #4931

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!