Channel: Hot Weekly Questions - Web Applications Stack Exchange

↧

Using UrlFetchApp.fetch(url) with regex to extract website data

February 20, 2021, 9:41 am

≫ Next: I cant find POST in my blog in Google Blogger

≪ Previous: How to disable SafeLinks for NOT Outlook.com accounts?

I'm trying to extract data from a list of >1000 URLs using a script that uses UrlFetchApp.fetch(url) and regex based on this article.

This is the code I'm using.

function importRegex(url, regex_string) {  var html, content = '';  var response = UrlFetchApp.fetch(url);  if (response) {    html = response.getContentText();    if (html.length && regex_string.length) {      var regex = new RegExp( regex_string, "i" );      content = html.match(regex)[1];    }  }  content = unescapeHTML(content);  Utilities.sleep(1000); // avoid call limit by adding a delay  return content;  }var htmlEntities = {  cent:  '¢',  pound: '£',  yen:   '¥',  euro:  '€',  copy:  '©',  reg:   '®',  lt:    '<',  gt:    '>',  mdash: '–',  quot:  '"',  amp:   '&',  apos:  '\''};function unescapeHTML(str) {    return str.replace(/\&([^;]+);/g, function (entity, entityCode) {        var match;        if (entityCode in htmlEntities) {            return htmlEntities[entityCode];        } else if (match = entityCode.match(/^#x([\da-fA-F]+)$/)) {            return String.fromCharCode(parseInt(match[1], 16));        } else if (match = entityCode.match(/^#(\d+)$/)) {            return String.fromCharCode(~~match[1]);        } else {            return entity;        }    });};

and the importregex function formula I'm using is

=importRegex(A4, "<h1 class=""ch-title"".*?>(.*)<\/h1>")

It gives the following error

TypeError: Cannot read property '1' of null (line 9).

I'm not sure how to fix it.

↧

Latest Images

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

July 20, 2025, 3:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

Trending Articles

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

February 13, 2020, 3:12 am

CCS University Result 2017-18 BA B.Com B.Sc CCSU Meerut Result

October 11, 2017, 12:20 am

Addison Rae – Headphones On – Single [iTunes Plus M4A]

April 17, 2025, 6:08 am

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

December 22, 2016, 3:50 am

Rachel Infiesta Arrested by Miami-Dade County Corrections on May 29, 2020

May 29, 2020, 12:00 am

Jail for Brockworth 26-year-old who gambled away thousands of pounds of...

December 13, 2014, 4:18 am

IMGPatcher 2.20 (Oct 15 2019)

January 7, 2020, 1:40 am

[RELEASE THREAD]--_A-Team_--Cricket_Dream_5G

September 25, 2022, 7:14 pm

Nalgonda District Police Office Mobile Numbers List in Telangana State

May 29, 2017, 8:30 pm

SANIDAPA LIVE IN GADAMBUWANA 2017

September 3, 2021, 5:56 pm

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Students hit streets to save Agriculture College land in city

October 13, 2018, 2:20 am

Download: Juvenile ft T-Sean – Shake Yuh Body (Insansa Niweka)

June 23, 2017, 10:50 am

Blacktown Workers Club Limited v Blacktown Workers Basketball Association...

May 31, 2017, 4:55 am

Social Worker at States of Jersey

October 20, 2020, 5:00 am

Love (2015).H264.Italian.English.Ac3.5.1.multisub.iCV-MIRCrew Seed (62)/Leech...

September 14, 2017, 10:49 am

CalCen

June 4, 2020, 6:35 pm

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

August 20, 2016, 5:13 pm

It’s Kind of a Funny Story 2010 Dual Audio 720p BRRip [Hindi – English] ESubs

June 8, 2016, 6:15 am

Waves Complete v2019.02.14 Incl Emulator-R2R

February 16, 2019, 7:50 am

Latest Images

7 clever tricks Primark does to keep you walking & buying more than you need...

7 clever tricks Primark does to keep you walking & buying more than you need...

July 20, 2025, 5:14 am

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

Art for Everyone! Autism advocacy, local stories, and indigenous pride in one...

July 20, 2025, 5:06 am

Paintings of English Downs 2

Paintings of English Downs 2

July 20, 2025, 4:30 am

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

How Kerala Women Rescued a Dying Forest and Turned It Into a Safe Haven for...

July 20, 2025, 3:30 am

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

Met Eireann warns of heavy rain & spot flooding for DAYS before big...

July 20, 2025, 1:14 am

Who is Kevin Lerena’s wife Geraldine?

Who is Kevin Lerena’s wife Geraldine?

July 20, 2025, 12:57 am

Man stabs woman, baby to death inside Queens home, police say

Man stabs woman, baby to death inside Queens home, police say

July 19, 2025, 11:00 pm

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

Ang papel ni whistleblower Julie Patidongan sa kaso ng mga nawawalang sabungero

July 19, 2025, 9:45 pm

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

Telangana Human Rights Commission (TGHRC) seeks report from revenue dept on...

July 19, 2025, 7:29 pm

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

Crisis-hit NHS fat cats raking in MASSIVE salaries as frontline services cry...

July 19, 2025, 2:11 pm

© 2025 //www.rssing.com