selenium downloads txt file with > and < instead of > and <

selenium downloads txt file with > and < instead of > and < - selenium

I'm downloading a text file using selenium (Firefox driver) and it has a mix of HTML and xbrl tags. When I look at the website in Chrome, it normally renders greater than signs and less than signs as > and < but when I download the txt file using selenium it has &gt an &lt in place of those elements.
part of actual website rendered in Chrome: txt in chrome
.txt downloaded via selenium:
txt downloaded via selenium using Firefox driver
opts = FirefoxOptions()
opts.add_argument("--headless")
browser = webdriver.Firefox(options=opts)
browser.get(txtURL)
with open(downloadedTxtFileName, "w") as f:
f.write(browser.page_source)

This is because the text file actually contains symbols like &gt and &lt.
When you open this file with browser the browser shows you symbols like < and > instead of the above actual textual symbols.
So, Selenium works correct.

Related

Python Selenium - open more Chrome apps

I'm using Selenium to automate certain stuff in Chrome and I know how to open multiple tabs, but is it possible to open Chrome itself multiple times?
Right now, when I want to open a new Chrome app the old one closes. I want it to stay open.

every time you want to open a new Chrome Browser you have to create a new instance of the webdriver.
from time import sleep
from selenium import webdriver
fist_driver = webdriver.Chrome(executable_path="/path/to/chromedriver")
fist_driver.get("https://google.com")
second_driver = webdriver.Chrome(executable_path="/path/to/chromedriver")
second_driver.get("https://ifconfig.me")
sleep(5)
# using for loop
for _ in range(2): # How much browser you want to open
driver = webdriver.Chrome(executable_path="/path/to/driver")
driver.get("https://google.com")
sleep(5)

Selenium File Upload Not Working in Azure Pipeline/ VM Machine

Hi I am Trying Achieve File upload Scenario My Test cases using selenium
Element I am Doing Actions On:-
<input _ngcontent-swn-c63="" type="file" id="undefined" multiple="" accept=".gif, .xlsm, .zip, .xls,.xlsx, .pdf, .ppt, .pptx, .doc, .docx, .jpeg, .jpg, .png, .csv, .txt">
Selenium Script I am Using :-
WebElement UploadElement=(new WebDriverWait(driver, BaseInitializer.defaultwaitTime)).until(ExpectedConditions.visibilityOfElementLocated(By.xpath("//input[#type='file' and #id='undefined']")));
((JavascriptExecutor)driver).executeScript("arguments[0].style.display = 'block';", UploadElement);
UploadElement.sendKeys(newname.getAbsolutePath());
While the above piece of code is working in My Local Machine(Chrome browser). Same script is not working while running the code in Azures Pipeline. The File is not getting reflected in Upload Element. What may be the issue? Or I am missing something?

How to use selenium in pandas to read a webpage?

I want to collect the information of a webpage using chromedriver. How do I install it and use it?

You have to install selenium first if you don't have it already. Then to use selenium:
from selenium.webdriver import Chrome
url="URL of the webpage you want to read"
setting up the driver
webdriver = "path of the chromedriver.exe file saved in your pc"
driver.get(url)
using css selector
y = driver.find_element_by_css_selector('css selector of the data you want to read from the webpage').text
print(y)

You don't install the chromedriver - you download the .exe (from here) and use the path to it in webdriver.Chrome(). This getting started page has a comprehensive guide:
from selenium import webdriver
driver = webdriver.Chrome('/path/to/chromedriver') # refers to the path where you saved the exe
driver.get('http://www.google.com/');
time.sleep(5) # Let the user actually see something!
search_box = driver.find_element_by_name('q')
search_box.send_keys('ChromeDriver')
search_box.submit()
time.sleep(5) # Let the user actually see something!
driver.quit()
Note: download the .exe that matches with your version of chrome!
(In Help > About Google Chrome)

As mentioned by #Patha_Mondal, you need to download the driver and select the elements you want to read. However, as your original question asks "How to use selenium in pandas to read a webpage?", I would say instead consider using Scrapy along with Selenium to create a ".csv" file from the Webpage Data.
Read the ".csv" data into pandas using pandas.read_csv() .
The data from the Webpage might not be clean or properly formatted. Using Scrapy to create a dataset out of it would be beneficial for reading it into pandas. Avoid using pandas directly in the same script as Selenium and Scrapy.
Hope it Helped.

Click on elements in Chrome Extension with selenium

I have been searching on the internet using Selenium (Java) interacting with Google Chrome Extension but have not been able to find an answer.
First Question
Is there a way to launch the chrome extension since Selenium only interact with WebView but not on the chrome extensions button in the browser ?
I try this method
"chrome-extension://id/index.html" but the extension did not launch as expected. I like find if there is another way to launch a chrome extension through selenium
Second Question
I am trying to click on the elements in a chrome extension with Selenium webdriver. How do I do it ? I tried the driver.CurrentWindowHandle , but it does not detect the chrome extension.
Thanks

Below is the solution with pyautogui (similar to autoit in java - so you can extend the same solution for java also).
Pre-Condition:
save the extension image in the project folder (I saved it under "autogui_ref_snaps" folder in my example with "capture_full_screenshot.png" name
Python:
Imports needed
from selenium import webdriver
from selenium.webdriver import ChromeOptions
from Common_Methods.GenericMethods import *
import pyautogui #<== need this to click on extension
Script:
options = ChromeOptions()
options.add_argument("--load-extension=" + r"C:\Users\supputuri\AppData\Local\Google\Chrome\User Data\Default\Extensions\fdpohaocaechififmbbbbbknoalclacl\5.1_0") #<== loading unpacked extension
driver = webdriver.Chrome(
executable_path=os.path.join(chrome_options=options)
url = "https://google.com/"
driver.get(url)
# get the extension box
extn = pyautogui.locateOnScreen(os.path.join(GenericMethods.get_full_path_to_folder('autogui_ref_snaps') + "/capture_full_screenshot.png"))
# click on extension
pyautogui.click(x=extn[0],y=extn[1],clicks=1,interval=0.0,button="left")
If you are loading an extension and it's not available in incognito mode then follow my answer in here to enable it.

Try to click on extension with this JsExecutor method:
driver.execute_script("window.postMessage('clicked_browser_action', '*')")

Failed- Path too long error while downloading file in Chrome using selenium

I want to download file in my current working directory using selenium automation. But I am getting 'Path too long' error. The code I have written so far is:
os.chdir(os.path.dirname(__file__))
current_directory = os.getcwd()
windows_cwd = current_directory.replace('\\','\\\\')+'\\\\'
chrome_options = webdriver.ChromeOptions()
prefs = {'download.default_directory': windows_cwd,
'download.directory_upgrade': True,
'safebrowsing.enabled': False,
'safebrowsing.disable_download_protection': True
}
chrome_options.add_experimental_option('prefs',prefs)
browser = webdriver.Chrome(options=chrome_options)
My current working directory is:
C:\Users\US177\PycharmProjects\Plugin
where the path is too long.
But it successfully downloads to
C:\Users\US177\Desktop
failed-long path

I'm not exactly sure what your question is based on the information provided, but I'm guessing it's along the lines of "Why is this happening?", so I will address that question.
The maximum length of a file name in Windows is 260 characters. The file is able to download to your desktop because the name of the file (when appended to your path) does not exceed this limit. When trying to download to PycharmProjects\Plugin\ folder, the path has become too long.

While setting your download path, try using double backslash (ie. path\\to\\directory).
See this Github issue about programatically downloading from chrome

We Keep Coding

sql objective-c vba vb.net react-native apache vue.js tensorflow api pandas

selenium downloads txt file with > and < instead of > and < - selenium

This is because the text file actually contains symbols like &gt and &lt. When you open this file with browser the browser shows you symbols like < and > instead of the above actual textual symbols. So, Selenium works correct.

Related

Python Selenium - open more Chrome apps

Selenium File Upload Not Working in Azure Pipeline/ VM Machine

How to use selenium in pandas to read a webpage?

Click on elements in Chrome Extension with selenium

Failed- Path too long error while downloading file in Chrome using selenium

Categories

Resources