Perl: GET Web Page Content

By Xah Lee. Date: . Last updated: .

In Perl, the easiest way to get a webpage is to use the Perl program HEAD or GET usually installed at /usr/bin. For example, in shell, type:

GET google.com

It'll return the web page content. You can save it to a file by GET google.com > myfile.txt.

HEAD and GET are two calling methods of the HTTP protocol. The Perl script are named that way for this reason. [see HTTP Protocol Tutorial]

For more contorl, use LWP::Simple or LWP::UserAgent. Both of these you need to install.

# -*- coding: utf-8 -*-
# perl

# get web page content

use strict;
# use LWP::Simple;
use LWP::UserAgent;

my $ua = new LWP::UserAgent;
$ua->timeout(120);
my $url='http://example.com/';
my $request = new HTTP::Request('GET', $url);
my $response = $ua->request($request);
my $content = $response->content();
print $content;

In the above, the $ua -> timeout(120); is a Object Oriented syntax.

If you have a question, put $5 at patreon and message me.

Perl

  1. Perl Overview
  2. Version String
  3. Help System

Detail

  1. Quoting String
  2. Format String
  3. String Operations
  4. True, False
  5. if then else
  6. Loop
  7. List / Array
  8. Loop Thru List
  9. Map f to List
  10. List Comprehension
  11. Hash Table
  12. Function Optional Param
  13. regex

Text Processing

  1. Unicode 🐪
  2. Convert File Encoding
  3. Read Write File
  4. Traverse Dir
  5. Find Replace
  6. Validate Local Links
  7. Split Line by Regex

Advanced

  1. Sort List, Matrix, Object
  2. Sort Matrix
  3. Sort Unstable
  4. Sort Misc
  5. List Modules, Search Paths
  6. Write a Module
  7. Complex Numbers
  8. System Call
  9. gzip
  10. Get Env Var
  11. GET Web Content
  12. Email