![]() ![]() q, -quiet Limit the information printed onto the console as the script executes a, -append Scrape links to an existing csv file p, -page Select specific page(s) to scrape Prevent links from being extracted on same page as w, -wait The amount of second to stop and wait for browser to load. p, -save The user's chosen absolute save path for the csv file h, -help show this help message and exit The file is temporary and will probably be removed after a little more progress on the link scraper script. This executable file does not involve command line arugments, thus making it easier to use. The executable file in the repository is a basic demo version of the current link scraper script. It should be noted that the default ouput for the csv file will be in the same folder as this script. It is also recommended that the ChromeDriver is installed and placed in the same folder as this script as there would be no need to specify the driver path later on. ChromeDriver which is required to work with Selenium, and ensure you're downloading the version that matches your Chrome build.Note it is recommended that pip is installed and/or up to date. ![]() ![]() A requirements text file will be included and the command pip3 install -r requirements.txt (or pip)can be used to install the required modules(also ensure that python was installed on path if errors involving pip pops up).Therefore, the following needs to be installed: ChromeDriver is also required to navigate through BiliBili. This script requires non-standard modules: requests, BeautifulSoup, and Selenium. The csv file can be used by annie or youtube-dl(not recommended as youtube-dl doesn't download all parts of the video) to download every single video with the given url in the csv file. Rather than right clicking each video in order to obtain its url, this script is designed to allow users to obtain all the video urls from each page and writing all of that into a csv file. Currently it is able to scrape all video links from search result pages, user's 投稿(submissions) pages, and user's 频道 (channel) pages. This is a simple script that scrapes all the video urls from certain BiliBili pages. ![]()
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |