2017-07-23 1 views
0

laravel-goutteWie scrape Wert oder URL, Quelle und Titel mit Dweidner/Laravel-Goutte?

$crawler = Goutte::request('GET', 'https://examplesite.com/'); 

$crawler->filter('.blog')->each(function ($node) { 

$uri = $node->html(); // $uri value is written below 
    dump($uri); 
}); 

hier ist der Wert von $ uri,

$uri = """<div class="blog" > 
    <a class="url" href="/blog/url"> 
     <div class="blog-screenshot"> 
      <img src="/blog/img/img.png" alt=""> 
     </div> 

     <span class="details">More Info</span> 
     <div class="author">By <span class="author">John Doe</span></div> 
     <h3 class="blog-title">BLOG TITLE</h3> 
    </a> 
    <div class="blog-actions"> 
     <a class="blog-preview" href="/blog/preview/url">Preview</a> 
    </div> 
</div>""" 

Nun, wie kann ich url href, img, src, Titel und Handlung von $ uri extrahieren?

Antwort

0

Versuchen Sie, diese

$crawler->filter('.blog')->each(function ($node) { 

    $blogUrl  = $node->find('.url', 0)->attr('href'); 

    $screenshotSrc = $node->find('.blog-screenshot > img', 0)->attr('src'); 

    $title   = $node->find('.blog-title', 0)->text(); 

    $previewUrl  = $node->find('.blog-preview', 0)->attr('href'); 

}); 
+0

Thank you! Es hat funktioniert, aber mit einigen Änderungen! $ crawler-> filter ('. Blog') -> each (Funktion ($ node)) { $ blogUrl = $ Knoten-> Filter ('. Url') -> link() -> getUri(); $ screenshotSrc = $ knoten-> filter ('img') -> attr ('src'); $ title = $ knoten-> filter ('. Blog-title') -> text(); $ previewUrl = $ node-> filter ('. blog-preview') -> attr ('href'); }); –