Optimize and improve parameter conversion in OCI8 8000 Statement #2495

morozov · 2016-09-02T17:22:52Z

Ocramius · 2016-09-07T20:39:06Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+
+        do {
+            if (!$quote) {
+                $result = $capture('/[?\'"]/', function ($token) use (


Please move the closure to a private method, and give it a specific name

morozov · 2016-09-12T23:47:49Z

Ocramius

Ocramius · 2017-01-14T22:30:39Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+     */
+    private static function findPlaceholderOrOpeningQuote(
+        $statement,
+        &$tokenOffset,


Having by-ref parameters is not something I'd want to maintain/debug later on. I see why it's made, but is there a way to put them together instead? Can we create an object that is passed down instead?

I think we can move all the parsing logic (not only the by-ref variables) into a different class. It might look like:

$foo = new Foo($sql); $newSql = $foo->getSql(); $paramMap = $foo->getParameterMap();

But I don't like that it only does get*() but doesn't do do*(). Maybe it could be a proxy (or sub-) class which would translate all messages with positional parameters (__construct(), bind*()) to the other object which only understands the named ones?

Ocramius · 2017-01-14T22:31:33Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+            return false;
+        }
+
+        if ($token == '?') {


strict comparison please

Ocramius · 2017-01-14T22:32:08Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+     * @param int $fragmentOffset The offset to build the next fragment from
+     * @param array $fragments Resulting query fragments
+     * @param string|null $currentLiteralDelimiter Current literal delimiter
+     * @param array $paramMap Parameter map


A bit unclear what kind of map this is. Consider using a syntax such as string[] for the type, and explaining what key and value would be

If I understand it correctly, this is an array<int, string>, where the keys are numeric (positional), and the strings are the parameter names

Yes, it's array<int, string>.

Ocramius · 2017-01-14T22:32:57Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+            $fragments[] = substr($statement, $fragmentOffset, $tokenOffset - $fragmentOffset);
+            $fragments[] = $param;
+            $paramMap[$position] = $param;
+            $fragmentOffset = ++$tokenOffset;


Split this into two statements: $tokenOffset += 1; $fragmentOffset = $tokenOffset;

Ocramius · 2017-01-14T22:34:27Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+        }
+
+        if ($token == '?') {
+            $position = count($paramMap) + 1;


Counting shouldn't be repeated: if the parameters of this method are refectored into a tiny object, this can be moved out.

Ocramius · 2017-01-14T22:36:11Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+        $currentLiteralDelimiter = null;
+
+        do {
+            if (!$currentLiteralDelimiter) {


If I understand the code correctly, you are currently looping over the structure, and adding to the $paramMap as you go, checking for the " starting and closing delimiters. Is that correct?

How does this logic currently work with multiline literals?

Yes. It works with multiline literals same as with others (I'll add tests for that). In regexes, we don't search for any character classes which do or do not match the newline characters. We only search for ", ' and ?.

Ocramius · 2017-01-14T22:37:00Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+     * @param string $tokenOffset The offset to start searching from
+     * @param int $fragmentOffset The offset to build the next fragment from
+     * @param array $fragments Resulting query fragments
+     * @param string|null $currentLiteralDelimiter Current literal delimiter


Description is a bit unclear/redundant

Ocramius · 2017-01-14T22:37:16Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+     * @param string $statement The SQL statement to parse
+     * @param string $tokenOffset The offset to start searching from
+     * @param int $fragmentOffset The offset to build the next fragment from
+     * @param array $fragments Resulting query fragments


Description is a bit unclear/redundant - what does determine a fragment?

A fragment is a part of the original statement not containing placeholders. When re-constructing the query, fragments are joined back together with new parameters between them. For example, for "SELECT * FROM users WHERE id = ? OR name = ?", the fragments are "SELECT * FROM users WHERE id = ", " OR name = " and "".

Probably, this much implementation details in a type description means there should be a dedicated type for that.

Ocramius · 2017-01-14T22:37:33Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+            $fragments[] = $param;
+            $paramMap[$position] = $param;
+            $fragmentOffset = ++$tokenOffset;
+        } else {


Return early instead of using else

Ocramius · 2017-01-14T22:38:09Z

lib/Doctrine/DBAL/Driver/OCI8/OCI8Statement.php

+        &$tokenOffset,
+        &$currentLiteralDelimiter
+    ) {
+        $token = self::findToken($statement, $tokenOffset, '/(?<!\\\\)' . $currentLiteralDelimiter . '/');


$currentLiteralDelimiter should be passed to preg_quote() here

Also, I don't really understand the ?<! sequence here: can you clarify it? (couldn't make it work on JS regex parsing tools)

Probably need to have a load of security-sensitive string escape sequences to test against here. As good as your intentions are, this is reeeeeally playing around with fire

It's a negative lookbehind: we're searching for the current literal delimiter which is not preceded by a backslash (i.e. not escaped). The backslash is escaped twice: once to avoid special meaning of the backslash a PHP string and once to avoid its special meaning in the regex.

Ocramius · 2017-05-18T13:42:56Z

morozov · 2017-05-18T18:16:06Z

morozov · 2017-05-25T02:25:54Z

Ocramius

Ocramius reviewed Sep 7, 2016
View reviewed changes

deeky666 requested a review from Ocramius January 14, 2017 21:08

deeky666 assigned Ocramius Jan 14, 2017

deeky666 added Drivers Improvement Oracle SQL Parser oci8 Prepared Statements labels Jan 14, 2017

Ocramius suggested changes Jan 14, 2017

View reviewed changes

Ocramius removed their assignment Jan 14, 2017

Ocramius added the Missing Tests label Jan 14, 2017

morozov added 5 commits May 16, 2017 15:29

Optimize parameter conversion for Oracle. Handle quotes inside literals.

724a662

Refactored parsing

4863bd2

Added some more code coverage, removed unnecessary condition

c1d2c6a

More tests

3b55e2b

Addressed some concerns

9c3afca

morozov force-pushed the optimize-oci8-convert branch from 396bc66 to 9c3afca Compare May 17, 2017 00:09

[DBAL-2495] Optimize and improve parameter conversion in OCI8Statement

8cbf648

1. Reworked the parser syntax to use proper Oracle escaping syntax as '' instead of \'. 2. Moved valid tests to the Functional section to ensure correct SQL syntax and logic.

Ocramius removed the Missing Tests label Jun 1, 2017

Ocramius self-assigned this Jun 1, 2017

Ocramius added this to the 2.6 milestone Jun 1, 2017

Ocramius approved these changes Jun 1, 2017

View reviewed changes

Ocramius merged commit ec3e510 into doctrine:master Jun 1, 2017

morozov deleted the optimize-oci8-convert branch June 1, 2017 18:36

morozov mentioned this pull request Jan 8, 2018

Remove hard dependency on PDO #2958

Merged

morozov mentioned this pull request Feb 7, 2020

The JSON storage is slow and unnecessarily CPU-consuming php-vcr/php-vcr#296

Closed

github-actions bot locked as resolved and limited conversation to collaborators Aug 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Optimize and improve parameter conversion in OCI8 8000 Statement #2495

Optimize and improve parameter conversion in OCI8Statement #2495

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Optimize and improve parameter conversion in OCI8 8000 Statement #2495

Optimize and improve parameter conversion in OCI8Statement #2495

Uh oh!

Conversation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!