Author Topic: [Bash]Split and Curl  (Read 734 times)

0 Members and 1 Guest are viewing this topic.

Offline gray-fox

  • Knight
  • **
  • Posts: 208
  • Cookies: 52
    • View Profile
[Bash]Split and Curl
« on: January 18, 2015, 11:23:02 pm »
Wasn't quit sure if i should have posted this in here http://evilzone.org/random/downloading-very-large-files/ , where the idea of this started. But i thought i have ready enough script to post it in here as a new topic.
If you haven't read the earlier topic, idea of this program is to download file from a server(it uses curl for that) in splitted parts and then merge them back to that original file.


I made this quit quickly and i haven't done bash scripting in a while, so there might be some minor bugs and weird pieces of code left. Also if someone/everyone sees something strange in my mathematical solutions i can confess that i suck in math, end of the story.

Okay, here is some usage example:
Code: [Select]
./scurl.sh <url>
Just give url as an argument like you would when using curl and program checks the size of the file and asks which sizes parts you want to make. Last splitted file of course gets what is left at the end.

Code: [Select]
./scurl.sh -m <give_wanted_file.name>
So this is the action, which merge splitted files back to original one.(or atleast it is suppose to :D ) I have check few md5sums and they have matched.

Code: [Select]
./scurl.sh <url> -p
This action gives you chance to choose which part of the splitted file you wan't to download. It also asks whichs size of parts you want to make. It roughly counts in how many pieces the file is going to split with size you have wanted for the parts. Last part/file ofc. once again gets what remains at the end, depending on the size of other  splitted parts.


But here is actual code:
Code: (bash) [Select]
#!/bin/bash
split_dl()
{
clear
check_url=$( curl -sI $URL | grep -i Accept-ranges )
if [[ $check_url = *"none"* ]]
then
    while [ "$ans" != "y" ] && [ "$ans" != "n" ]
    do
        clear
        echo "Server doesn't seem to support byte ranging"
        echo "Continue anyway(y/n):"
        read ans
    done
    if [ $ans = "n" ]
    then
        exit
    fi
elif [[ $check_url != *"bytes"* ]]
then
    clear
    echo "'bytes' not found in 'Accept-ranges' header."
    echo "If error occurs try downloading from different server"
    sleep 2
fi
byte_sz=$( curl -sI $URL | grep -i Content-Length | awk '{print $2}' )
mod_byte_sz=$(tr -d '\r' <<< "$byte_sz")
if [[ $mod_byte_sz -eq 0 ]]
then
    clear
    echo "Couldn't fetch size of the file"
    echo "Give exact file size(MB):"
    read alter_sz
    mod_byte_sz=$(( $alter_sz * 1048576))
fi
mb_sz=$(( $mod_byte_sz / 1048576 ))
clear
echo "Size of the file is: $mb_sz MB"
echo "Give size(MB) of the first part you want to split:"
read part_sz
count=$(( $mb_sz / $part_sz - 1 ))
psz=$(( $part_sz * 1048576 ))
i=1
j=0
while [ $count -gt 0 ]
do
    curl --range $j-$(( $psz * $i )) -o file.part$i $URL
    count=$(( $count - 1 ))
    i=$(( $i + 1 ))
    j=$(( $j + $psz + 1 ))
done
curl --range $j- -o file.part$i $URL
}

part_dl()
{
clear
clear
check_url=$( curl -sI $URL | grep -i Accept-ranges )
if [[ $check_url = *"none"* ]]
then
    while [ "$ans" != "y" ] && [ "$ans" != "n" ]
    do
        clear
        echo "Server doesn't seem to support byte ranging"
        echo "Continue anyway(y/n):"
        read ans
    done
    if [ $ans = "n" ]
    then
        exit
    fi
elif [ [$check_url != *"bytes"* ]]
then
    clear
    echo "'bytes' not found in 'Accept-ranges' header."
    echo "If error occurs try downloading from different server"
    sleep 2
fi
byte_sz=$( curl -sI $URL | grep -i Content-Length | awk '{print $2}' )
mod_byte_sz=$(tr -d '\r' <<< "$byte_sz")
if [[ $mod_byte_sz -eq 0 ]]
then
    clear
    echo "Couldn't fetch size of the file"
    echo "Give exact file size(MB):"
    read alter_sz
    mod_byte_sz=$(( $alter_sz * 1048576))
fi
mb_sz=$(( $mod_byte_sz / 1048576 ))
clear
echo "Size of the file is: $mb_sz MB"
echo "Give size(MB) of the first part you want to split:"
read part_sz
count=$(( $mb_sz / $part_sz ))
clear
echo "File is splitted in $count parts"
echo "Select which part to download: "
read part
count=$(( $mb_sz / $part_sz - 1 ))
psz=$(( $part_sz * 1048576 ))
i=1
j=0
while [ $count -gt 0 ]
do
    if [ $i = $part ]
    then
        curl --range $j-$(( $psz*$i )) -o file.part$i $URL
    fi
    count=$(( $count - 1 ))
    i=$(( $i + 1 ))
    j=$(( $j + $psz + 1 ))
done
if [ $i = $part ]
then
    curl --range $j- -o file.part$i $URL
fi
}
help_menu()
{
echo "Usage1: scurl.sh <url> [optional -p]"
echo "Usage2: scurl.sh <-m> <output_file.name>"
echo "-m, merge downloaded parts"
echo "-p, choose one part to download"
}

merge()
{
echo "Merging $f_name"
cat file.part* &gt; $f_name
}
clear

if [ $# -eq 0 ]
then
    help_menu
fi


if [ $# -eq 1 ]
then
    URL=$1
    split_dl   
fi
if [ $# -gt 1 ]
then
    if [ $1 = "-m" ]
    then
        f_name=$2
        merge
    elif [ $2 = "-p" ]
    then
        URL=$1
        part_dl
    else
        help_menu
    fi
fi


Like i said, i made this really quickly and haven't glanced the code over and over again.(Like i usually do)
But IMO the concept of this program is quit interesting, so that's why i already share it, even if my solutions in code are far a way from flawless. ;D
« Last Edit: January 20, 2015, 08:58:50 pm by gray-fox »

Offline Syntax990

  • Peasant
  • *
  • Posts: 129
  • Cookies: 77
  • Bruce Willis
    • View Profile
    • Evilzone "Hack"
Re: [Bash]Split and Curl
« Reply #1 on: January 18, 2015, 11:49:07 pm »
Massive respect for this man! My input on that thread weren't exactly special, but you just blew us out the water!

+1

Offline HTH

  • Official EZ Slut
  • Administrator
  • Knight
  • *
  • Posts: 395
  • Cookies: 158
  • EZ Titan
    • View Profile
Re: [Bash]Split and Curl
« Reply #2 on: January 18, 2015, 11:56:46 pm »
+1 for ingenuity


I must note however for any potential readers that not all servers or clients are required to support byte ranges. I believe most do as of right now but it is worth mentioning.


Aside from that, and really thats not your fault anyway :p Good job! :)
<ande> HTH is love, HTH is life
<TurboBorland> hth is the only person on this server I can say would successfully spitefuck peoples women

Offline z3ro

  • Knight
  • **
  • Posts: 345
  • Cookies: 60
    • View Profile
Re: [Bash]Split and Curl
« Reply #3 on: January 19, 2015, 06:16:11 am »
Nice job man!!  ;D
+1
~ God is real. Unless declared as an integer.

Offline gray-fox

  • Knight
  • **
  • Posts: 208
  • Cookies: 52
    • View Profile
Re: [Bash]Split and Curl
« Reply #4 on: January 20, 2015, 04:49:32 pm »
I must note however for any potential readers that not all servers or clients are required to support byte ranges. I believe most do as of right now but it is worth mentioning.
Have to admit that i didn't know this. Thanks for the info! :)
I researched this a bit and tried to add some sort of error handling. Even though curl's own error message about this seems to be quite clear, i still prefer adding also something self written:
Code: [Select]
check_url=$( curl -sI $URL | grep -i Accept-ranges )
if [[ $check_url = *"none"* ]]
then
    while [ "$ans" != "y" ] && [ "$ans" != "n" ]
    do
        clear
        echo "Server doesn't seem to support byte ranging"
        echo "Continue anyway(y/n):"
        read ans
    done
    if [ $ans = "n" ]
    then
        exit
    fi
elif [[ $check_url != *"bytes"* ]]
then
    clear
    echo "'bytes' not found in 'Accept-Ranges' header. If error occurs try downloading from different server"
    sleep 2
fi

I also add option for user to input size of the file if there is problem with Content-Lenght header.
Code: [Select]
if [[ $mod_byte_sz -eq 0 ]]
then
    echo "Couldn't fetch size of the file"
    echo "Give exact size of the file(MB):"
    read alter_sz
    mod_byte_sz=$(( $alter_sz * 1048576))
fi

edit: Fixed some typos in code
« Last Edit: January 20, 2015, 08:59:25 pm by gray-fox »