The Branding Store | Logo Design, Web Design and E-commerce specialists.| Pembroke Pines, Florida.

08 Feb

New JavaScript Features That Will Change How You Write Regex

by TBSCategories: News

New JavaScript Features That Will Change How You Write Regex

Faraz Kelhini

2019-02-08T13:00:32+01:00 2019-02-08T15:04:39+00:00

There’s a good reason the majority of programming languages support regular expressions: they are extremely powerful tools for manipulating text. Text processing tasks that require dozens of lines of code can often be accomplished with a single line of regular expression code. While the built-in functions in most languages are usually sufficient to perform search and replace operations on strings, more complex operations — such as validating text inputs — often require the use of regular expressions.

Regular expressions have been part of the JavaScript language since the third edition of the ECMAScript standard, which was introduced in 1999. ECMAScript 2018 (or ES2018 for short) is the ninth edition of the standard and further improves the text processing capability of JavaScript by introducing four new features:

Lookbehind assertions
Named capture groups
s (dotAll) Flag
Unicode property escapes

These new features are explained in detail in the subsections that follow.

Debugging JavaScript

console.log can tell you a lot about your app, but it can’t truly debug your code. For that, you need a full-fledged JavaScript debugger. Read more →

Lookbehind Assertions

The ability to match a sequence of characters based on what follows or precedes it enables you to discard potentially undesired matches. This is especially important when you need to process a large string and the chance of undesired matches is high. Fortunately, most regular expression flavors provide the lookbehind and lookahead assertions for this purpose.

Prior to ES2018, only lookahead assertions were available in JavaScript. A lookahead allows you to assert that a pattern is immediately followed by another pattern.

There are two versions of lookahead assertions: positive and negative. The syntax for a positive lookahead is (?=...). For example, the regex /Item(?= 10)/ matches Item only when it is followed, with an intervening space, by number 10:

const re = /Item(?= 10)/;  console.log(re.exec('Item')); // → null  console.log(re.exec('Item5')); // → null  console.log(re.exec('Item 5')); // → null  console.log(re.exec('Item 10')); // → ["Item", index: 0, input: "Item 10", groups: undefined]

This code uses the exec() method to search for a match in a string. If a match is found, exec() returns an array whose first element is the matched string. The index property of the array holds the index of the matched string, and the input property holds the entire string that the search performed on. Finally, if named capture groups are used in the regular expression, they are placed on the groups property. In this case, groups has a value of undefined because there is no named capture group.

The construct for a negative lookahead is (?!...). A negative lookahead asserts that a pattern is not followed by a specific pattern. For example, the pattern /Red(?!head)/ matches Red only if it not followed by head:

const re = /Red(?!head)/;  console.log(re.exec('Redhead')); // → null  console.log(re.exec('Redberry')); // → ["Red", index: 0, input: "Redberry", groups: undefined]  console.log(re.exec('Redjay')); // → ["Red", index: 0, input: "Redjay", groups: undefined]  console.log(re.exec('Red')); // → ["Red", index: 0, input: "Red", groups: undefined]

ES2018 complements lookahead assertions by bringing lookbehind assertions to JavaScript. Denoted by (?<=...), a lookbehind assertion allows you to match a pattern only if it is preceded by another pattern.

Let’s suppose you need to retrieve the price of a product in euro without capturing the euro symbol. With a lookbehind, this task becomes a lot simpler:

const re = /(?<=€)d+(.d*)?/;  console.log(re.exec('199')); // → null  console.log(re.exec('$ 199')); // → null  console.log(re.exec('€199')); // → ["199", undefined, index: 1, input: "€199", groups: undefined]

Note: Lookahead and lookbehind assertions are often referred to as “lookarounds”.

The negative version of lookbehind is denoted by (?<!...) and enables you to match a pattern that is not preceded by the pattern specified within the lookbehind. For example, the regular expression /(?<!d{3}) meters/ matches the word “meters” if three digits do not come before it:

const re = /(?<!d{3}) meters/;  console.log(re.exec('10 meters')); // → [" meters", index: 2, input: "10 meters", groups: undefined]  console.log(re.exec('100 meters'));     // → null

As with lookaheads, you can use several lookbehinds (negative or positive) in succession to create a more complex pattern. Here’s an example:

const re = /(?<=d{2})(?<!35) meters/;  console.log(re.exec('35 meters')); // → null  console.log(re.exec('meters')); // → null  console.log(re.exec('4 meters')); // → null  console.log(re.exec('14 meters')); // → ["meters", index: 2, input: "14 meters", groups: undefined]

This regex matches a string containing meters only if it is immediately preceded by any two digits other than 35. The positive lookbehind ensures that the pattern is preceded by two digits, and then the negative lookbehind ensures that the digits are not 35.

Named Capture Groups

You can group a part of a regular expression by encapsulating the characters in parentheses. This allows you to restrict alternation to a part of the pattern or apply a quantifier on the whole group. Furthermore, you can extract the matched value by parentheses for further processing.

The following code gives an example of how to find a file name with .jpg extension in a string and then extract the file name:

const re = /(w+).jpg/; const str = 'File name: cat.jpg'; const match = re.exec(str); const fileName = match[1];  // The second element in the resulting array holds the portion of the string that parentheses matched console.log(match); // → ["cat.jpg", "cat", index: 11, input: "File name: cat.jpg", groups: undefined]  console.log(fileName); // → cat

In more complex patterns, referencing a group using a number just makes the already cryptic regular expression syntax more confusing. For example, suppose you want to match a date. Since the position of day and month is swapped in some regions, it’s not clear which group refers to the month and which group refers to the day:

const re = /(d{4})-(d{2})-(d{2})/; const match = re.exec('2020-03-04');  console.log(match[0]);    // → 2020-03-04 console.log(match[1]);    // → 2020 console.log(match[2]);    // → 03 console.log(match[3]);    // → 04

ES2018’s solution to this problem is named capture groups, which use a more expressive syntax in the form of (?<name>...):

const re = /(?<year>d{4})-(?<month>d{2})-(?<day>d{2})/; const match = re.exec('2020-03-04');  console.log(match.groups);          // → {year: "2020", month: "03", day: "04"} console.log(match.groups.year);     // → 2020 console.log(match.groups.month);    // → 03 console.log(match.groups.day);      // → 04

Because the resulting object may contain a property with the same name as a named group, all named groups are defined under a separate object called groups.

A similar construct exists in many new and traditional programming languages. Python, for example, uses the (?P<name>) syntax for named groups. Not surprisingly, Perl supports named groups with syntax identical to JavaScript (JavaScript has imitated its regular expression syntax from Perl). Java also uses the same syntax as Perl.

In addition to being able to access a named group through the groups object, you can access a group using a numbered reference — similar to a regular capture group:

const re = /(?<year>d{4})-(?<month>d{2})-(?<day>d{2})/; const match = re.exec('2020-03-04');  console.log(match[0]);    // → 2020-03-04 console.log(match[1]);    // → 2020 console.log(match[2]);    // → 03 console.log(match[3]);    // → 04

The new syntax also works well with destructuring assignment:

const re = /(?<year>d{4})-(?<month>d{2})-(?<day>d{2})/; const [match, year, month, day] = re.exec('2020-03-04');  console.log(match);    // → 2020-03-04 console.log(year);     // → 2020 console.log(month);    // → 03 console.log(day);      // → 04

The groups object is always created, even if no named group exists in a regular expression:

const re = /d+/; const match = re.exec('123');  console.log('groups' in match);    // → true

If an optional named group does not participate in the match, the groups object will still have a property for that named group but the property will have a value of undefined:

const re = /d+(?<ordinal>st|nd|rd|th)?/;  let match = re.exec('2nd');  console.log('ordinal' in match.groups);    // → true console.log(match.groups.ordinal);         // → nd  match = re.exec('2');  console.log('ordinal' in match.groups);    // → true console.log(match.groups.ordinal);         // → undefined

You can refer to a regular captured group later in the pattern with a backreference in the form of . For example, the following code uses a capture group that matches two letters in a row, then recalls it later in the pattern:

console.log(/(ww)/.test('abab'));    // → true  // if the last two letters are not the same  // as the first two, the match will fail console.log(/(ww)/.test('abcd'));    // → false

To recall a named capture group later in the pattern, you can use the /k<name>/ syntax. Here is an example:

const re = /b(?<dup>w+)s+k<dup>b/;  const match = re.exec("I'm not lazy, I'm on on energy saving mode");          console.log(match.index);    // → 18 console.log(match[0]);       // → on on

This regular expression finds consecutive duplicate words in a sentence. If you prefer, you can also recall a named capture group using a numbered back reference:

const re = /b(?<dup>w+)s+b/;  const match = re.exec("I'm not lazy, I'm on on energy saving mode");          console.log(match.index);    // → 18 console.log(match[0]);       // → on on

It’s also possible to use a numbered back reference and a named backreference at the same time:

const re = /(?<digit>d)::k<digit>/;  const match = re.exec('5:5:5');          console.log(match[0]);    // → 5:5:5

Similar to numbered capture groups, named capture groups can be inserted into the replacement value of the replace() method. To do that, you will need to use the $ <name> construct. For example:

const str = 'War & Peace';  console.log(str.replace(/(War) & (Peace)/, '$ 2 & $ 1'));     // → Peace & War  console.log(str.replace(/(?<War>War) & (?<Peace>Peace)/, '$ <Peace> & $ <War>'));     // → Peace & War

If you want to use a function to perform the replacement, you can reference the named groups the same way you would reference numbered groups. The value of the first capture group will be available as the second argument to the function, and the value of the second capture group will be available as the third argument:

const str = 'War & Peace';  const result = str.replace(/(?<War>War) & (?<Peace>Peace)/, function(match, group1, group2, offset, string) {     return group2 + ' & ' + group1; });  console.log(result);    // → Peace & War

`s` (`dotAll`) Flag

By default, the dot (.) metacharacter in a regex pattern matches any character with the exception of line break characters, including line feed (n) and carriage return (r):

console.log(/./.test('n'));    // → false console.log(/./.test('r'));    // → false

Despite this shortcoming, JavaScript developers could still match all characters by using two opposite shorthand character classes like [wW], which instructs the regex engine to match a character that’s a word character (w) or a non-word character (W):

console.log(/[wW]/.test('n'));    // → true console.log(/[wW]/.test('r'));    // → true

ES2018 aims to fix this problem by introducing the s (dotAll) flag. When this flag is set, it changes the behavior of the dot (.) metacharacter to match line break characters as well:

console.log(/./s.test('n'));    // → true console.log(/./s.test('r'));    // → true

The s flag can be used on per-regex basis and thus does not break existing patterns that rely on the old behavior of the dot metacharacter. Besides JavaScript, the s flag is available in a number of other languages such as Perl and PHP.

Recommended reading: An Abridged Cartoon Introduction To WebAssembly

Unicode Property Escapes

Among the new features introduced in ES2015 was Unicode awareness. However, shorthand character classes were still unable to match Unicode characters, even if the u flag was set.

Consider the following example:

const str = '𝟠';  console.log(/d/.test(str));     // → false console.log(/d/u.test(str));    // → false

𝟠 is considered a digit, but d can only match ASCII [0-9], so the test() method returns false. Because changing the behavior of shorthand character classes would break existing regular expression patterns, it was decided to introduce a new type of escape sequence.

In ES2018, Unicode property escapes, denoted by p{...}, are available in regular expressions when the u flag is set. Now to match any Unicode number, you can simply use p{Number}, as shown below:

const str = '𝟠'; console.log(/p{Number}/u.test(str));     // → true

And to match any Unicode alphabetic character, you can use p{Alphabetic}:

const str = '漢';  console.log(/p{Alphabetic}/u.test(str));     // → true  // the w shorthand cannot match 漢 console.log(/w/u.test(str));    // → false

P{...} is the negated version of p{...} and matches any character that p{...} does not:

console.log(/P{Number}/u.test('𝟠'));    // → false console.log(/P{Number}/u.test('漢'));    // → true  console.log(/P{Alphabetic}/u.test('𝟠'));    // → true console.log(/P{Alphabetic}/u.test('漢'));    // → false

A full list of supported properties is available on the current specification proposal.

Note that using an unsupported property causes a SyntaxError:

console.log(/p{undefined}/u.test('漢'));    // → SyntaxError

Compatibility Table

Desktop Browsers

	Chrome	Firefox	Safari	Edge
Lookbehind Assertions	62	X	X	X
Named Capture Groups	64	X	11.1	X
`s` (`dotAll`) Flag	62	X	11.1	X
Unicode Property Escapes	64	X	11.1	X

Mobile Browsers

	ChromeFor Android	FirefoxFor Android	iOS Safari	Edge Mobile	Samsung Internet	Android Webview
Lookbehind Assertions	62	X	X	X	8.2	62
Named Capture Groups	64	X	11.3	X	X	64
`s` (`dotAll`) Flag	62	X	11.3	X	8.2	62
Unicode Property Escapes	64	X	11.3	X	X	64

Node.js

8.3.0 (requires --harmony runtime flag)
8.10.0 (support for s (dotAll) flag and lookbehind assertions)
10.0.0 (full support)

Wrapping Up

ES2018 continues the work of previous editions of ECMAScript by making regular expressions more useful. New features include lookbehind assertion, named capture groups, s (dotAll) flag, and Unicode property escapes. Lookbehind assertion allows you to match a pattern only if it is preceded by another pattern. Named capture groups use a more expressive syntax compared to regular capture groups. The s (dotAll) flag changes the behavior of the dot (.) metacharacter to match line break characters. Finally, Unicode property escapes provide a new type of escape sequence in regular expressions.

When building complicated patterns, it’s often helpful to use a regular-expressions tester. A good tester provides an interface to test a regular expression against a string and displays every step taken by the engine, which can be especially useful when trying to understand patterns written by others. It can also detect syntax errors that may occur within your regex pattern. Regex101 and RegexBuddy are two popular regex testers worth checking out.

Do you have some other tools to recommend? Share them in the comments!

(dm, il)

Articles on Smashing Magazine — For Web Designers And Developers

12 Jul

How to Write an Academic Article Critique

by TBSCategories: News

This is a sponsored post.

A critique of an academic article is an objective analysis of the topic discussed in the article, as to whether the author has supported his key points with applicable and reasonable arguments based on relevant facts or not. It is rather easy to get caught up while summarizing the points of the article, without actually challenging and analyzing it.

A good critique of an article will effectively demonstrate your impressions of the academic article, while providing relevant evidence to back up these impressions.

It is not an easy job to write a critique, which covers all the aspects of the article. Rather, it can be a tough job for individuals. Even if a person is able to write a critique of the mentioned article, it may not cover all points or may not be of sufficient quality.

Therefore, in order to write a quality academic article critique, it is important to follow some necessary tips.

Five of these essential tips are mentioned below, which can help you becoming a professional academic article critic:

1. Become an Active Reader

The first and foremost point in writing a critique is to become an attentive and active reader. If you lazily go through an article, you will not be able to grasp the main idea of the article. While reading the article for the first time, you should understand the meaning of each sentence thoroughly, without skipping anything.

The overall argument which the author is trying to make should be understood in the first reading. Further readings may help you in criticizing various aspects. However, if you do not understand the key argument you cannot expect to criticize it.

On the second and third reading, mark the important text as you go through the article. Ask yourself various questions like what is the main argument of the article? Who is the intended audience? Why was the research on this topic initially done? Does the author have enough evidence to support his argument? Are there any loopholes in the author’s argument?

2. Gathering Evidence

Question whether the evidence acquired by the author is logical or not. Even if the author has mentioned the respective names of individuals from whom he has taken reference from, test and analyze whether the author’s argument is practical for real world applications or not.

You must also search for any biased aspects where the author has taken a definitive side, without explaining a solid reason. Biases can be related to politics, races, gender, ethnicity or class. Bias can also include ignoring relevant evidence, and unfairly make conclusions.

Also examine whether the introduction and conclusion of the article are supporting the overall article or not, and whether they are well-written for their respective purpose or not. It is important to double check the references, which the author has referred to, and check if they are relevant or not.

3. Dig Deep

Use your educated opinions and knowledge to either disagree or support the author’s article. Perform research on related material, and dig deeper into the material to know the relevancy of the author’s article. The more you know about the author’s key main argument, the better will you be able to critique it.

Over-sourcing and too much of good evidence may become a problem, if your main arguments become repetitive. Make sure each of the source is providing something unique to your overall critique, and you are not repeating too much of the same thing. Moreover, do not allow your researched material to crown out your own arguments and opinions. While relevancy is important, people are here to listen to your own opinion and analysis of the article.

4. Format your Critique

It is very important to properly format your critique. The introduction of your critique should not be more than two paragraphs and should be able to lay out the initial framework of your critique. Start by noting where the article succeeds or fails most dramatically and reason out why. Do understand that introduction is not the proper place to provide evidence of your opinions. The evidence will be mentioned in body paragraphs.

It is also important to mention the name of the author, the article title, journal or publication the article has appeared in, publication date, and a statement about the key argument.

5. Concluding your Critique

Conclude the critique by summarizing your own argument and by suggesting potential solutions or implications. You should provide a recap of the key points, which you have gone through in the article. You must do your best to make a lasting impression on the reader’s mind in the conclusion by using assertive and straight-forward language in order to demonstrate the importance of your work.

So, do you want to write an excellent article-critique? If Yes, the above useful tips can be your benchmark in writing an article critique, no matter what type of article it is.

Onextrapixel – Web Design and Development Online Magazine

Tag: Write

New JavaScript Features That Will Change How You Write Regex

New JavaScript Features That Will Change How You Write Regex

Debugging JavaScript

Lookbehind Assertions

Named Capture Groups

`s` (`dotAll`) Flag

Unicode Property Escapes

Compatibility Table

Desktop Browsers

Mobile Browsers

Node.js

Wrapping Up

From Idea To Development: How To Write Mobile Application Requirements That Work

How to Write an Academic Article Critique

1. Become an Active Reader

2. Gathering Evidence

3. Dig Deep

4. Format your Critique

5. Concluding your Critique

Write Your Next Web App With Ember CLI

Tag: Write

New JavaScript Features That Will Change How You Write Regex

Debugging JavaScript

Lookbehind Assertions

Named Capture Groups

s (dotAll) Flag

Unicode Property Escapes

Compatibility Table

Desktop Browsers

Mobile Browsers

Node.js

Wrapping Up

From Idea To Development: How To Write Mobile Application Requirements That Work

How to Write an Academic Article Critique

1. Become an Active Reader

2. Gathering Evidence

3. Dig Deep

4. Format your Critique

5. Concluding your Critique

Write Your Next Web App With Ember CLI

`s` (`dotAll`) Flag