Author Topic: [Source] Crawler.py  (Read 918 times)

0 Members and 1 Guest are viewing this topic.

Offline daxda

  • Peasant
  • *
  • Posts: 114
  • Cookies: 112
  • Not the guy you're looking for
    • View Profile
    • Daxda on Github
[Source] Crawler.py
« on: November 17, 2013, 04:24:43 pm »
Created a crawler today, it extracts links from the passed URL and stores the gathered list in a text file in the same folder as the script, called harvested.txt (it gets overwritten so be careful).

Usage is straight forward:
python Crawler.py -u <target url> [-d<0-20> optional level of depth, default 0] [-s optional flag that the crawler must stay in the same domain]

As always, free code!
Use it or loose it.
Thanks for reading my source, feedback, critique, motivation(any kind) is welcome.

clone link: https://gist.github.com/7514550.git
[gist]Daxda/7514550[/gist]
« Last Edit: April 23, 2014, 08:17:47 pm by daxda »

Offline proxx

  • Avatarception
  • Global Moderator
  • Titan
  • *
  • Posts: 2803
  • Cookies: 256
  • ФФФ
    • View Profile
Re: [Source] Crawler.py
« Reply #1 on: November 17, 2013, 04:28:04 pm »
Am I stupid or where is the link ?
Wtf where you thinking with that signature? - Phage.
This was another little experiment *evillaughter - Proxx.
Evilception... - Phage