Regex brain failure...

Kaveh Bazargan kaveh at rivervalley.io
Sat Feb 3 19:04:18 EST 2024


For testing regex you might find it useful to use regex101. It's excellent
and you can save the page. I put your text here
<https://regex101.com/r/OwsGnl/1> just for testing. pls note your tabs are
corrupted in the email and I put an "a" to make it work just for test.

On Sat, 3 Feb 2024 at 21:13, Paul Dupuis via use-livecode <
use-livecode at lists.runrev.com> wrote:

> Never mind.
>
> The correct pattern is: ^\d+?\t.\tnontraditional
> field\tText\t2,319\tInterview 1\.txt$
>
> There is a column with a space in it between the number column (1st
> column) and the 3rd column (which I thought was the 2nd column) that has
> the code name in it (ie. nontradtional field). Now to figure out why
> that is!
>
> On 2/3/2024 1:36 PM, Paul Dupuis via use-livecode wrote:
> > I have a (reduced) example set of data in a variable "tCaseCodes" that
> > is tab delimited set of lines below:
> >
> > 1         I am making a high salary    Text    2,319    Interview 1.txt
> > 2         nontraditional field    Text    2,319    Interview 1.txt
> > 3         gets married and stays married    Text    453,561  Interview
> > 1.txt
> > 4         wants kids    Text    927,1009    Interview 1.txt
> > 5         leaves work when kids born doesn't return    Text
> >  1012,1609    Interview 1.txt
> > 6         takes major responsibility for family work    Text
> >  1012,1609    Interview 1.txt
> >
> > I have a Regex pattern in the variable "tCodeToMatch" shown below:
> >
> > ^\d+\tnontraditional field\tText\t2,319\tInterview 1.txt$
> >
> > I am executing the line of livecode script:
> >
> > filter lines of tCaseCodes with regex tCodeToMatch into tDuplicates
> >
> > The variable tDuplicates should then contain:
> >
> > 2         nontraditional field    Text    2,319    Interview 1.txt
> >
> > But is instead, empty.
> >
> > Clearly, I must have made a Regex pattern mistake, but I am not seeing
> > it. It is ^(start of line) \d+(any number of digits) \t(tab)
> > nontraditional field  \t(tab) Text  \t(tab) 2,319  \t(tab) Interview
> > 1.txt $(end of line)
> >
> > I thought that the period in the file name (Interview 1.txt) may have
> > been an issue as period is a reserved regex character to match a
> > single character. However, I get the same empty result if I escape the
> > period, so it must be something else. I believe \d+ gets me an integer
> > as the number in this column could be several digits long.
> >
> > A second set of regex eyes would be appreciated.
> >
> > _______________________________________________
> > use-livecode mailing list
> > use-livecode at lists.runrev.com
> > Please visit this url to subscribe, unsubscribe and manage your
> > subscription preferences:
> > http://lists.runrev.com/mailman/listinfo/use-livecode
>
>
> _______________________________________________
> use-livecode mailing list
> use-livecode at lists.runrev.com
> Please visit this url to subscribe, unsubscribe and manage your
> subscription preferences:
> http://lists.runrev.com/mailman/listinfo/use-livecode
>


-- 
Kaveh Bazargan PhD
Director
River Valley Technologies <http://rivervalley.io> ● Twitter
<https://twitter.com/rivervalley1000> ● LinkedIn
<https://www.linkedin.com/in/bazargankaveh/> ● ORCID
<https://orcid.org/0000-0002-1414-9098> ● @kaveh1000 at mastodon.social
<https://mastodon.social/@kaveh1000>
*Accelerating the Communication of Research*

*
<https://www.linkedin.com/posts/bazargankaveh_ismte-innovation-award-recipient-kaveh-bazargan-activity-7039348552526921728-XAEB/?utm_source=share&utm_medium=member_desktop>
 [image:
https://rivervalley.io/gigabyte-wins-the-alpsp-scholarly-publishing-innovation-award-using-river-valleys-publishing-technology/]
<https://rivervalley.io/gigabyte-wins-the-alpsp-scholarly-publishing-innovation-award-using-river-valleys-publishing-technology/>*


More information about the use-livecode mailing list