php - Looping through a table with Simple HTML DOM

I'm using Simple HTML DOM to extract data from a HTML document, and I have a couple of issues that I need some help with.

On the line that begins with if ($td->find('a')) I want to extract the href and the content of the anchor node separately, and place them in separate variables. The code however doesn't work (see output from echoes in the code below).

What is the best way to do this? Note that my purpose is to create a XML document out of the information later on, so I need the information in the correct order.

The links leads to pages containing detailed information about the different cars (e.g. "Max speed", "Price" etc) that I also want to extract and put into separate variables. How can I get hold of data on these pages?

include 'simple_html_dom.php';

$html = new simple_html_dom();

$html = file_get_html('http://www.example.com/foo.html');

$items = array();

foreach ($html->find('table') as $table) {
    foreach ($table->find('tr') as $tr) {

        foreach ($tr->find('td') as $td) {

            if ($td->find('a')) {

                $link = $td->find('a.href');
                echo $link;  // empty

                $text = $td->find('a.text');
                echo $text; // Array
            }
            else {
                echo 'Name: ' . $td;
            }
        }

    }
}

The HTML document looks like this:


    









        ... and so on...

 Answer
 

Use $td->find('a', 0)->href and $td->find('a', 0)->innertext to access element attributes in the first case, and contents in the second. Also, if there might be multiple anchor to be found, use 0 as a safe guard to always get the first one.

    





-

August 27, 2017











Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest












No comments:







Post a Comment










Newer Post


Older Post

Home


Subscribe to:
Post Comments (Atom)



casting - Why wasn&#39;t Tobey Maguire in The Amazing Spider-Man? -
Movies &amp; TV

In the Spider-Man franchise, Tobey Maguire  is an outstanding performer as a Spider-Man and also reprised his role in the sequels Spider-Man...









php - Looping through a table with Simple HTML DOM
I'm using Simple HTML DOM to extract data from a HTML document, and I have a couple of issues that I need some help with. On the line th...





How to start object orient programming in C++?
  Basically I am from C, Embedded C field. After working for 5 yrs in this field, I would like to start C++. Now, I have started learning C+...





How I write a function in python that determines whether a word has no
vowels?
How I write a function "noVowel" in python that determines whether a word has no vowels? In my case, "y" is not a vowel....















Search This Blog



                    
                Car 1
                        
                Porsche
                    
                    
                Car 2

                        
                Chrysler

Blog

Sunday, 27 August 2017

php - Looping through a table with Simple HTML DOM

No comments:

Post a Comment

casting - Why wasn't Tobey Maguire in The Amazing Spider-Man? - Movies & TV

Blog Archive

Sunday, 27 August 2017

php - Looping through a table with Simple HTML DOM

No comments:

Post a Comment

casting - Why wasn&#39;t Tobey Maguire in The Amazing Spider-Man? - Movies &amp; TV

Blog Archive

casting - Why wasn't Tobey Maguire in The Amazing Spider-Man? - Movies & TV