Match Pattern Special Characters Confusion

bienieck · ‎12-13-2024

The page Special Characters for Match Pattern, documenting the special characters for the Match Pattern function, states that square brackets [] are special characters used to define a character class. It also mentions that:

"The Match Pattern function interprets special characters inside square brackets literally, with the following exceptions: -, ~, and ^."

At the same time, the special character \ is described as:

"Cancels the interpretation of any special character in this list."

Now, my question is: Which takes precedence - cancelling with \ or defining a character class? My understanding is that if we have, for example, [~.\], this creates a character class that matches any character, including non-displayable characters, except . and \. In other words, [~.\] and [~\.] should produce the same result. However, the actual result differs.

It seems that canceling with \ has the highest priority, and \ inside [] is also treated as a special character. This would mean the list of exceptions (-, ~, and ^) is incomplete and should include \ as well.

Additionally, I believe the pattern used in Get File Extension.vi should not be \.[~\.]*$ but rather \.[~.]*$. It looks like the intention of the author was to escape .. However, the documentation explicitly states:

"Within square brackets, . is literal."

Michał Bieńkowski
CLA, CTA, CPI

Did someone devote their time to help solve your problem? Appreciate it and give kudos.
Problem solved? Accept as a solution so that others can find it faster in the future.
Contribute to the development of TestStand by voting on the TestStand Idea Exchange.

wiebe@CARYA · ‎12-16-2024

I think you're making it too difficult.

@bienieck wrote:
Now, my question is: Which takes precedence - cancelling with \ or defining a character class? My understanding is that if we have, for example, [~.\], this creates a character class that matches any character, including non-displayable characters, except . and \. In other words, [~.\] and [~\.] should produce the same result. However, the actual result differs.

Both the . and the \ are special characters.

[.\], that escapes the ']', so the reg ex isn't valid (the start '[' has no close ']'). For instance '[a\]]' tries to match 'a' and ']'.

[\.], this escapes the '.', so the reg. ex matches a '.', not 'any character'..

[.], , the '.' is a special character, but inside '[]' it doesn't need to be escaped. [.a] tries to match '.' and 'a'.

[~a], the '~' is a special character, but inside '[]' it doesn't need to be escaped. [~a] tries to match '~' and 'a'.

Search LabVIEW like a graph!

bienieck · ‎12-17-2024

If so, I'm right about unnecessary escaping in the mentioned VI and the incomplete documentation.

Michał Bieńkowski
CLA, CTA, CPI

Did someone devote their time to help solve your problem? Appreciate it and give kudos.
Problem solved? Accept as a solution so that others can find it faster in the future.
Contribute to the development of TestStand by voting on the TestStand Idea Exchange.

bienieck · ‎12-17-2024

wiebe@CARYA wrote:

[~a] tries to match '~' and 'a'.

I think you are not correct here.

Michał Bieńkowski
CLA, CTA, CPI

Did someone devote their time to help solve your problem? Appreciate it and give kudos.
Problem solved? Accept as a solution so that others can find it faster in the future.
Contribute to the development of TestStand by voting on the TestStand Idea Exchange.

wiebe@CARYA · ‎12-17-2024

@bienieck wrote:

wiebe@CARYA wrote:

[~a] tries to match '~' and 'a'.

I think you are not correct here.

I looked at Match Regular Expression. Match Pattern could indeed be different.

I hardly use Match Pattern anymore, as Match Regular Expression is more powerful, and much better documented (as PCRE isn't limited to NI).

Search LabVIEW like a graph!

bienieck · ‎12-17-2024

I switched to Match Pattern for simple patterns after watching some presentation from Darren N. Allegedly this function is more efficient.

Michał Bieńkowski
CLA, CTA, CPI

Did someone devote their time to help solve your problem? Appreciate it and give kudos.
Problem solved? Accept as a solution so that others can find it faster in the future.
Contribute to the development of TestStand by voting on the TestStand Idea Exchange.

wiebe@CARYA · ‎12-17-2024

@bienieck wrote:

I switched to Match Pattern for simple patterns after watching some presentation from Darren N. Allegedly this function is more efficient.

For the CPU: probably.

For the programmer: not always.

A huge performance difference is caused by the difference in resulting string: Match Pattern returns SubStrings, Match Regular Expression returns Strings... Seems to me Match Regular Expression (an XNode) could be changed so it returns SubStrings too...

Search LabVIEW like a graph!

Martin_Henz · ‎12-17-2024

A huge performance difference is caused by the difference in resulting string: Match Pattern returns SubStrings, Match Regular Expression returns Strings... Seems to me Match Regular Expression (an XNode) could be changed so it returns SubStrings too...

I'm with you - in most cases. Speed also depends on much less features of Match Pattern. Think of backward references, named substrings, greedy/ungreedy, atomic grouping, possessive quantifiers and much more.

I don't like Match Regular Expression because I cannot specify the substring I need. The underlaying call library node is known, so it is possible to implement that. But there is actually no way for named substrings.

Yamaeda · ‎12-18-2024

wiebe@CARYA wrote:

@bienieck wrote:

I switched to Match Pattern for simple patterns after watching some presentation from Darren N. Allegedly this function is more efficient.

For the CPU: probably.

For the programmer: not always.

A huge performance difference is caused by the difference in resulting string: Match Pattern returns SubStrings, Match Regular Expression returns Strings... Seems to me Match Regular Expression (an XNode) could be changed so it returns SubStrings too...

If i remember Darrens video correctly one of the arguments against Match Regular Expression was that it was an Xnode and could cause issues when compiling, apart from being noticably slower on simple tasks. Often the last part is irrelevant, but it can be good to know.

G# - Award winning reference based OOP for LV, for free! - Qestit VIPM GitHub

Qestit Systems

wiebe@CARYA · ‎12-18-2024

@Yamaeda wrote:

wiebe@CARYA wrote:

@bienieck wrote:

I switched to Match Pattern for simple patterns after watching some presentation from Darren N. Allegedly this function is more efficient.

For the CPU: probably.

For the programmer: not always.

A huge performance difference is caused by the difference in resulting string: Match Pattern returns SubStrings, Match Regular Expression returns Strings... Seems to me Match Regular Expression (an XNode) could be changed so it returns SubStrings too...

If i remember Darrens video correctly one of the arguments against Match Regular Expression was that it was an Xnode and could cause issues when compiling, apart from being noticably slower on simple tasks. Often the last part is irrelevant, but it can be good to know.

It doesn't need to be an XNode.

Sometimes the XNode isn't convenient at all: Add Match Regular Expression sub VI - NI Community

Search LabVIEW like a graph!

LabVIEW

Match Pattern Special Characters Confusion

Match Pattern Special Characters Confusion

Re: Match Pattern Special Characters Confusion

Re: Match Pattern Special Characters Confusion

Re: Match Pattern Special Characters Confusion

Re: Match Pattern Special Characters Confusion

Re: Match Pattern Special Characters Confusion

Re: Match Pattern Special Characters Confusion

Re: Match Pattern Special Characters Confusion

Re: Match Pattern Special Characters Confusion

Re: Match Pattern Special Characters Confusion