Improve using `//` in XPath performance #249

naitoh · 2025-04-26T09:53:56Z

When using // in XPath, the deeper the tag hierarchy, the slower it becomes due to the namespace acquisition process.
Caching namespace information improves performance when using // with XPath.

Benchmark (Comparison with rexml 3.4.1)

$ benchmark-driver benchmark/xpath.yaml
Calculating -------------------------------------
                                                     rexml 3.4.1      master  3.4.1(YJIT)  master(YJIT)
REXML::XPath.match(REXML::Document.new(xml), 'a//a')      29.215     234.909      108.945       492.410 i/s -     100.000 times in 3.422925s 0.425697s 0.917898s 0.203083s

Comparison:
             REXML::XPath.match(REXML::Document.new(xml), 'a//a')
                                        master(YJIT):       492.4 i/s
                                              master:       234.9 i/s - 2.10x  slower
                                         3.4.1(YJIT):       108.9 i/s - 4.52x  slower
                                         rexml 3.4.1:        29.2 i/s - 16.85x  slower

YJIT=ON : 4.52x faster
YJIT=OFF : 8.04x faster

lib/rexml/element.rb

tompng · 2025-04-27T10:12:21Z

I have some concerns about maintainability because cache invalidation bug is hard to debug.
Namespace can be modified through add_attribute/delete_attribute and through attr_reader :attributes. It can bypass cache invalidation mechanism.

Here are some ideas of cache strategy.

Use a temporary cache and pass it as a method parameter

def node_test
  namespaces_cache = {}
  # Cache scope is enclosed in this method.
  # There is no need to worry about cache invalidation.
  call node.namespaces(namespaces_cache) to use the cache
end

def namespaces(cache = nil)
  if cache
    cache[self] ||= calculate_namespaces(cache)
  else
    calculate_namespaces
  end
end

Automatically invalidate cache with immutable structure

def namespaces
  parent_namespaces = parent&.namespaces
  # If parent.namespaces or @attributes is changed, cache is automatically invalidated
  return @namespaces if parent_namespaces.equal?(@parent_namespaces) && @attrubutes.frozen?

  @attributes.freeze # Need to unfreeze before modifying @attributes
  @parent_namespaces = parent_namespaces
  @namespaces = @parent_namespaces.merge(@attributes).freeze
end

lib/rexml/element.rb

naitoh · 2025-04-27T22:41:02Z

I have some concerns about maintainability because cache invalidation bug is hard to debug. Namespace can be modified through add_attribute/delete_attribute and through attr_reader :attributes. It can bypass cache invalidation mechanism.

Here are some ideas of cache strategy.
1. Use a temporary cache and pass it as a method parameter
def node_test
  namespaces_cache = {}
  # Cache scope is enclosed in this method.
  # There is no need to worry about cache invalidation.
  call node.namespaces(namespaces_cache) to use the cache
end

def namespaces(cache = nil)
  if cache
    cache[self] ||= calculate_namespaces(cache)
  else
    calculate_namespaces
  end
end

For maintainability, I have changed my mind about adopting policy 1 above.
Thanks!

kou · 2025-04-28T01:35:49Z

I have a bit concern about adding a cache (positional) argument to only some public namespace related APIs.

I feel that it may a bit ad-hoc approach. I'm not object this approach but can we improve our API?

For example, how about keeping a cache per document?

class XPathParser
  def parse path, nodeset
    path_stack = @parser.parse( path )
    nodeset.first.document.enable_cache do
      # New document level cache is created and available in this block.
      # This API is thread unsafe. Users can't change this document in this block.
      match( path_stack, nodeset )
    end
  end
end

naitoh · 2025-04-28T07:57:19Z

I add Document#enable_namespaces_cache method to control whether caching is enabled or disabled.
What do you think?

lib/rexml/element.rb

lib/rexml/document.rb

lib/rexml/xpath_parser.rb

naitoh · 2025-04-29T00:06:15Z

I have corrected the points you pointed out.
What do you think?

lib/rexml/document.rb

lib/rexml/xpath_parser.rb

naitoh · 2025-04-30T14:33:39Z

I have corrected the points you pointed out.
What do you think?

naitoh · 2025-05-03T07:23:31Z

I have corrected the points you pointed out.
What do you think?

kou · 2025-05-03T08:18:11Z

Let's revisit this after #252. This implementation will be changed by #252.

#252) `XPath.match`, `XPath.first`, `XPath.each`, `XPathParser#parse` and `XPathParser#match` accepted nodeset as element. This pull request changes the first parameter of these method to be an element instead of nodeset. Passing nodeset will be deprecated. ```ruby # Documented usage. OK REXML::XPath.match(element, xpath) # Undocumented usage. Deprecate in this pull request nodeset = [element] REXML::XPath.match(nodeset, xpath) ``` ### Background #249 will introduce a temporary cache. ```ruby def parse path, nodeset path_stack = @parser.parse( path ) nodeset.first.document.send(:enable_cache) do match( path_stack, nodeset ) end end ``` But the signature `XPathParser#match(path, nodeset)` does not guarantee that all nodes in the nodeset has the same root document. So cache does not work in the code below. It's still slow. ```ruby REXML::XPath.match(2.times.map { REXML::Document.new('<a>'*400+'</a>'*400) }, 'a//a') ``` The interface is holding our back, so I propose to drop accepting array as element. This change is a backward incompatibility, but it just drops undocumented feature. I think only the test code was unintentionally using this feature. ### XPath.match with array XPath.match only traverse the first element of the array for some selectors. ```ruby nodeset = [REXML::Document.new("<a><b/></a>"), REXML::Document.new("<a><c/></a>")] REXML::XPath.match(nodeset, "a/*") #=> [<b/>, <c/>] REXML::XPath.match(nodeset, "//a/*") #=> [<b/>] # I expect [<b/>, <c/>] but the second document is ignored ``` It indicates that `XPath.match` is not designed to search inside multiple nodes/documents. --------- Co-authored-by: Sutou Kouhei <kou@cozmixng.org>

naitoh · 2025-05-07T20:13:58Z

#252 merged, so I rebased this PR and updated.
What do you think?

lib/rexml/attribute.rb

lib/rexml/document.rb

lib/rexml/element.rb

lib/rexml/xpath_parser.rb

naitoh · 2025-05-08T15:27:20Z

I have corrected the points you pointed out.
What do you think?

test/xpath/test_base.rb

naitoh · 2025-05-08T21:37:52Z

I have corrected the points you pointed out.
What do you think?

test/test_core.rb

lib/rexml/xpath_parser.rb

naitoh · 2025-05-11T13:42:57Z

I have corrected the points you pointed out.
What do you think?

test/xpath/test_base.rb

## Why? for fix get_namespace performance > # FIXME: This DOUBLES the time XPath searches take Co-authored-by: tomoya ishida <tomoyapenguin@gmail.com> Co-authored-by: Sutou Kouhei <kou@clear-code.com>

## Benchmark (Comparison with rexml 3.4.1) ``` $ benchmark-driver benchmark/xpath.yaml Calculating ------------------------------------- rexml 3.4.1 master 3.4.1(YJIT) master(YJIT) REXML::XPath.match(REXML::Document.new(xml), 'a//a') 29.215 234.909 108.945 492.410 i/s - 100.000 times in 3.422925s 0.425697s 0.917898s 0.203083s Comparison: REXML::XPath.match(REXML::Document.new(xml), 'a//a') master(YJIT): 492.4 i/s master: 234.9 i/s - 2.10x slower 3.4.1(YJIT): 108.9 i/s - 4.52x slower rexml 3.4.1: 29.2 i/s - 16.85x slower ``` - YJIT=ON : 4.52x faster - YJIT=OFF : 8.04x faster

naitoh · 2025-05-11T21:34:04Z

I have corrected the points you pointed out.
What do you think?

kou · 2025-05-12T01:22:16Z

Thanks.

naitoh marked this pull request as ready for review April 26, 2025 10:00

naitoh requested a review from kou April 26, 2025 10:00

tompng reviewed Apr 26, 2025

View reviewed changes

lib/rexml/element.rb Outdated Show resolved Hide resolved

tompng reviewed Apr 26, 2025

View reviewed changes

lib/rexml/element.rb Outdated Show resolved Hide resolved

naitoh force-pushed the fix_get_namespace_performance branch from 5bf858e to 119ed65 Compare April 27, 2025 01:11

naitoh requested a review from tompng April 27, 2025 01:23

tompng reviewed Apr 27, 2025

View reviewed changes

lib/rexml/element.rb Outdated Show resolved Hide resolved

kou reviewed Apr 27, 2025

View reviewed changes

lib/rexml/element.rb Outdated Show resolved Hide resolved

naitoh force-pushed the fix_get_namespace_performance branch from 119ed65 to 34cca4c Compare April 27, 2025 22:23

naitoh requested review from tompng and kou April 27, 2025 22:42

naitoh force-pushed the fix_get_namespace_performance branch from 34cca4c to cea9bb8 Compare April 28, 2025 07:46

kou reviewed Apr 28, 2025

View reviewed changes

naitoh force-pushed the fix_get_namespace_performance branch from cea9bb8 to ffe6f9a Compare April 28, 2025 11:00

naitoh requested a review from kou April 28, 2025 11:16

kou reviewed Apr 30, 2025

View reviewed changes

lib/rexml/document.rb Outdated Show resolved Hide resolved

lib/rexml/xpath_parser.rb Outdated Show resolved Hide resolved

naitoh force-pushed the fix_get_namespace_performance branch from ffe6f9a to 24cfc1b Compare April 30, 2025 14:26

naitoh requested a review from kou April 30, 2025 14:33

tompng mentioned this pull request Apr 30, 2025

Deprecate accepting array as an element in XPath.match, first and each #252

Merged

naitoh force-pushed the fix_get_namespace_performance branch 3 times, most recently from ff07142 to acf7ecf Compare May 3, 2025 07:17

naitoh force-pushed the fix_get_namespace_performance branch from acf7ecf to 7279481 Compare May 7, 2025 20:05

kou reviewed May 8, 2025

View reviewed changes

naitoh force-pushed the fix_get_namespace_performance branch from 7279481 to 88ec36f Compare May 8, 2025 15:14

naitoh requested a review from kou May 8, 2025 15:27

tompng reviewed May 8, 2025

View reviewed changes

test/xpath/test_base.rb Outdated Show resolved Hide resolved

naitoh force-pushed the fix_get_namespace_performance branch from 88ec36f to 1f67fbd Compare May 8, 2025 21:31

naitoh requested a review from tompng May 8, 2025 21:37

kou reviewed May 9, 2025

View reviewed changes

test/test_core.rb Outdated Show resolved Hide resolved

kou reviewed May 9, 2025

View reviewed changes

lib/rexml/xpath_parser.rb Show resolved Hide resolved

naitoh force-pushed the fix_get_namespace_performance branch from 1f67fbd to b28407e Compare May 11, 2025 13:19

naitoh requested a review from kou May 11, 2025 13:42

kou reviewed May 11, 2025

View reviewed changes

test/xpath/test_base.rb Outdated Show resolved Hide resolved

test/xpath/test_base.rb Outdated Show resolved Hide resolved

test/xpath/test_base.rb Outdated Show resolved Hide resolved

naitoh and others added 3 commits May 12, 2025 06:12

Element#namespace test add

5669ad3

Optimization of Element#namespace and Element#namespaces with cache

6d2cf70

## Why? for fix get_namespace performance > # FIXME: This DOUBLES the time XPath searches take Co-authored-by: tomoya ishida <tomoyapenguin@gmail.com> Co-authored-by: Sutou Kouhei <kou@clear-code.com>

naitoh force-pushed the fix_get_namespace_performance branch from b28407e to 6c573e0 Compare May 11, 2025 21:17

naitoh requested a review from kou May 11, 2025 21:33

kou merged commit e80ffdd into ruby:master May 12, 2025
66 of 67 checks passed

naitoh deleted the fix_get_namespace_performance branch May 12, 2025 02:13

Improve using // in XPath performance #249

Improve using // in XPath performance #249

Uh oh!

Conversation

Uh oh!

Benchmark (Comparison with rexml 3.4.1)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Improve using `//` in XPath performance #249

Improve using `//` in XPath performance #249