# RPA-Python **Repository Path**: jianfeihua/RPA-Python ## Basic Information - **Project Name**: RPA-Python - **Description**: RPA-Python - **Primary Language**: Python - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 2 - **Forks**: 1 - **Created**: 2021-11-08 - **Last Updated**: 2025-06-07 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # RPA for Python :snake: [**v1.45**](https://github.com/tebelorg/RPA-Python/releases) | [**Use Cases**](#use-cases) | [**API Reference**](#api-reference) | [**About & Credits**](#about--credits) | [**Run on Cloud**](https://colab.research.google.com/drive/13bQO6G_hzE1teX35a3NZ4T5K-ICFFdB5?usp=sharing) | [**PyCon Video**](https://www.youtube.com/watch?v=F2aQKWx_EAE) | [**Telegram Chat**](https://t.me/rpa_chat) >_This tool was previously known as TagUI for Python. [More details](https://github.com/tebelorg/RPA-Python/issues/100) on the name change, which is backward compatible so existing scripts written with `import tagui as t` and `t.function()` will still work._ ![RPA for Python demo in Jupyter notebook](https://raw.githubusercontent.com/tebelorg/Tump/master/tagui_python.gif) To install this Python package for RPA (robotic process automation) - ``` pip install rpa ``` To use it in Jupyter notebook, Python script or interactive shell - ```python import rpa as r ``` Notes on different operating systems and optional visual automation mode - - :rainbow_flag: **Windows -** if visual automation is cranky, try setting your display zoom level to recommended % or 100% - :apple: **macOS -** Catalina update introduces tighter app security, see solutions for [PhantomJS](https://github.com/tebelorg/RPA-Python/issues/79) and [Java popups](https://github.com/tebelorg/RPA-Python/issues/78) - :penguin: **Linux -** visual automation mode requires special setup on Linux, see how to [install OpenCV and Tesseract](https://sikulix-2014.readthedocs.io/en/latest/newslinux.html) # Use Cases RPA for Python's simple and powerful API makes robotic process automation fun! You can use it to quickly automate away repetitive time-consuming tasks on websites, desktop applications, or the command line. #### WEB AUTOMATION :spider_web: ```python r.init() r.url('https://www.google.com') r.type('//*[@name="q"]', 'decentralization[enter]') print(r.read('result-stats')) r.snap('page', 'results.png') r.close() ``` #### VISUAL AUTOMATION :see_no_evil: ```python r.init(visual_automation = True) r.dclick('outlook_icon.png') r.click('new_mail.png') ... r.type('message_box.png', 'message') r.click('send_button.png') r.close() ``` #### OCR AUTOMATION 🧿 ```python r.init(visual_automation = True, chrome_browser = False) print(r.read('pdf_window.png')) print(r.read('image_preview.png')) r.hover('anchor_element.png') print(r.read(r.mouse_x(), r.mouse_y(), r.mouse_x() + 400, r.mouse_y() + 200)) r.close() ``` #### KEYBOARD AUTOMATION :musical_keyboard: ```python r.init(visual_automation = True, chrome_browser = False) r.keyboard('[cmd][space]') r.keyboard('safari[enter]') r.keyboard('[cmd]t') r.keyboard('mortal kombat[enter]') r.wait(2.5) r.snap('page.png', 'results.png') r.close() ``` #### MOUSE AUTOMATION :mouse: ```python r.init(visual_automation = True) r.type(600, 300, 'open source') r.click(900, 300) r.snap('page.png', 'results.png') r.hover('button_to_drag.png') r.mouse('down') r.hover(r.mouse_x() + 300, r.mouse_y()) r.mouse('up') r.close() ``` #### TELEGRAM NOTIFICATION :phone: ```python r.telegram('1234567890', 'ID can be string or number, r.init() is not required') r.telegram(1234567890, 'Hello World. Olá Mundo. नमस्ते दुनिया. 안녕하세요 세계. 世界,你好。') r.telegram(1234567890, 'Use backslash n for new line\nThis is line 2 of the message') r.telegram(1234567890, 'Sent using my VPS server endpoint https://tebel.org/rpapybot') r.telegram(1234567890, 'Sent using your own hosted endpoint', 'https://your_endpoint') ``` # API Reference [**Notes**](#general-notes) | [**Element Identifiers**](#element-identifiers) | [**Core Functions**](#core-functions) | [**Basic Functions**](#basic-functions) | [**Pro Functions**](#pro-functions) | [**Helper Functions**](#helper-functions) --- #### GENERAL NOTES See [sample Python script](https://github.com/tebelorg/RPA-Python/blob/master/sample.py), the [RPA Challenge solution](https://github.com/tebelorg/RPA-Python/issues/120#issuecomment-610518196), and [RedMart groceries example](https://github.com/tebelorg/RPA-Python/issues/24). To send a Telegram app notification, [simply look up @rpapybot](https://github.com/tebelorg/RPA-Python/issues/281#issue-942803794) to allow receiving messages. To automate Chrome browser invisibly, use [headless mode](https://github.com/tebelorg/RPA-Python#core-functions). To run 10X faster instead of normal human speed, use [turbo mode](https://github.com/tebelorg/RPA-Python/issues/297) (read the caveats!). You can even run on your phone browser [using this Colab notebook](https://colab.research.google.com/drive/13bQO6G_hzE1teX35a3NZ4T5K-ICFFdB5?usp=sharing) (eg datascraping with up to 5 Colab sessions). By design this package has [enterprise security](https://github.com/kelaberetiv/TagUI/blob/master/README.md#enterprise-security-by-design) and you can install, update and use it [without the internet](https://github.com/tebelorg/RPA-Python#core-functions). You can fully [control error handling](https://github.com/tebelorg/RPA-Python/issues/269) if you wish to, eg snapshot of missing element or posting to custom API endpoint. For fine-grained control on web browser file download location, use [download_location()](https://github.com/tebelorg/RPA-Python/issues/279#issuecomment-877749880). For overriding parent folder location to install and invoke TagUI ([forked version](https://github.com/tebelorg/TagUI) optimised for this package), use [tagui_location()](https://github.com/tebelorg/RPA-Python/issues/257#issuecomment-846602776). #### ELEMENT IDENTIFIERS An element identifier helps to tell RPA for Python exactly which element on the user interface you want to interact with. For example, //\*[@id='email'] is an XPath pointing to the webpage element having the id attribute 'email'. - :globe_with_meridians: For web automation, the web element identifier can be [XPath selector](https://www.linkedin.com/posts/kensoh_xpath-rpa-tagui-activity-6829673864633704448-Iw-D), CSS selector, or the following attributes - id, name, class, title, aria-label, text(), href, in decreasing order of priority. Recommend writing XPath manually or simply using attributes. There is automatic waiting for an element to appear before timeout happens, and error is returned that the element cannot be found. To change the default timeout of 10 seconds, use timeout() function. PS - if you are going to use a Chrome extension for XPaths, use [SelectorsHub](https://chrome.google.com/webstore/detail/selectorshub/ndgimibanhlabgdgjcpbbndiehljcpfh?hl=en). - :camera_flash: An element identifier can also be a .png or .bmp image snapshot representing the UI element (can be on desktop applications, terminal window or web browser). If the image file specified does not exist, OCR will be used to search for that text on the screen to act on the UI element containing the text, eg r.click('Submit Form.png'). Transparency (0% opacity) is supported in .png images. x, y coordinates of elements on the screen can be used as well. Notes for visually [automating 2 monitors](https://github.com/tebelorg/RPA-Python/issues/252#issuecomment-844277454), macOS Retina display [issue and solution](https://github.com/tebelorg/RPA-Python/issues/170#issuecomment-843168745). - :page_facing_up: A further image identifier example is a png image of a window (PDF viewer, MS Word, textbox etc) with the center content of the image set as transparent. This allows using read() and snap() to perform OCR and save snapshots of application windows, containers, frames, textboxes with varying content. See this [image example](https://user-images.githubusercontent.com/10379601/124394598-b59cfd80-dd32-11eb-93bb-68504c91afb9.png) of a PDF frame with content removed to be transparent. For read() and snap(), x1, y1, x2, y2 coordinates pair can be used to define the region of interest on the screen to perform OCR or capture snapshot. #### CORE FUNCTIONS Function|Parameters|Purpose :-------|:---------|:------ init()|visual_automation = False, chrome_browser = True|start TagUI, auto-setup on first run close()||close TagUI, Chrome browser, SikuliX pack()||for deploying package without internet update()||for updating package without internet debug()|True or False|print & log debug info to rpa_python.log >_by default RPA for Python runs at normal human speed, to run 10X faster use init(turbo_mode = True)_ #### BASIC FUNCTIONS Function|Parameters|Purpose :-------|:---------|:------ url()|webpage_url (no parameter to return current URL)|go to web URL click()|element_identifier (or x, y using visual automation)| left-click on element rclick()|element_identifier (or x, y using visual automation)|right-click on element dclick()|element_identifier (or x, y using visual automation)|double-click on element hover()|element_identifier (or x, y using visual automation)|move mouse to element type()|element_identifier (or x, y), text_to_type ('[enter]', '[clear]')|enter text at element select()|element_identifier (or x, y), option_value / text (or x, y)|choose dropdown option read()|element_identifier (page = web page) (or x1, y1, x2, y2)|fetch & return element text snap()|element_identifier (page = web page), filename_to_save|save screenshot to file load()|filename_to_load|load & return file content dump()|text_to_dump, filename_to_save|save text to file write()|text_to_write, filename_to_save|append text to file ask()|text_to_prompt|ask & return user input >_to wait for an element to appear until timeout() value, use hover(). to drag-and-drop, [you can do this](https://github.com/tebelorg/RPA-Python/issues/58#issuecomment-570778431)_ #### PRO FUNCTIONS Function|Parameters|Purpose :-------|:---------|:------ telegram()|telegram_id, text_to_send (first look up [@rpapybot](https://github.com/tebelorg/RPA-Python/issues/281#issue-942803794))|send Telegram message keyboard()|keys_and_modifiers (using visual automation)|send keystrokes to screen mouse()|'down' or 'up' (using visual automation)|send mouse event to screen wait()|delay_in_seconds (default 5 seconds)|explicitly wait for some time table()|table number or XPath, filename_to_save|save webpage table to CSV upload()|element_identifier (CSS only), filename_to_upload|upload file to web element download()|download_url, filename_to_save(optional)|download from URL to file unzip()|file_to_unzip, unzip_location (optional)|unzip zip file to specified location frame()|main_frame id or name, sub_frame (optional)|set web frame, frame() to reset popup()|string_in_url (no parameter to reset to main page)|set context to web popup tab run()|command_to_run (use ; between commands)|run OS command & return output dom()|statement_to_run (JS code to run in browser)|run code in DOM & return output vision()|command_to_run (Python code for SikuliX)|run custom SikuliX commands timeout()|timeout_in_seconds (blank returns current timeout)|change wait timeout (default 10s) keyboard() modifiers and special keys - >_[shift] [ctrl] [alt] [win] [cmd] [clear] [space] [enter] [backspace] [tab] [esc] [up] [down] [left] [right] [pageup] [pagedown] [delete] [home] [end] [insert] [f1] .. [f15] [printscreen] [scrolllock] [pause] [capslock] [numlock]_ #### HELPER FUNCTIONS Function|Parameters|Purpose :-------|:---------|:------ exist()|element_identifier|return True or False if element exists before timeout present()|element_identifier|return True or False if element is present now count()|element_identifier|return number of web elements as integer clipboard()|text_to_put or no parameter|put text or return clipboard text as string get_text()|source_text, left, right, count = 1|return text between left & right markers del_chars()|source_text, characters|return text after deleting given characters mouse_xy()||return '(x,y)' coordinates of mouse as string mouse_x()||return x coordinate of mouse as integer mouse_y()||return y coordinate of mouse as integer title()||return page title of current web page as string text()||return text content of current web page as string timer()||return time elapsed in sec between calls as float >_to type a large amount of text quickly, use clipboard() and keyboard() to paste instead of type()_ # About & Credits TagUI is a leading open-source RPA software :robot: with tens of thousands of users. It was created in 2016-2017 when I left DBS Bank as a test automation engineer, to embark on a one-year sabbatical to Eastern Europe. Most of its code base was written in Novi Sad Serbia. My wife and I also spent a couple of months in Budapest Hungary, as well as Chiang Mai Thailand for visa runs. In 2018, I joined AI Singapore to continue development of TagUI. Over a few months in 2019, I took on a daddy role full-time, taking care of my newborn baby girl and wife :cowboy_hat_face:🤱. In between nannying, I used my time pockets to create this Python package built on TagUI. I hope that [RPA for Python and ML frameworks](https://www.linkedin.com/posts/nived-n-776470139_rpa-tagui-automation-activity-6805844546950438912-Wiq4) would be good friends, and `pip install rpa` would make life easier for Python users. I am maintaining the package in my personal time and I'm happy that tens of thousands of people use it now. At only ~1k lines of code, it would make my day to see developers of other languages port this project over to their favourite programming language. See ample comments in this [single-file package](https://github.com/tebelorg/RPA-Python/blob/master/tagui.py), and its intuitive architecture. Work is now under way by other open-source folks to create TagUI packages for [C# .NET](https://www.nuget.org/packages/tagui) and [Go](https://www.linkedin.com/posts/kensoh_hi-fans-of-go-programming-language-would-activity-6804658389772324864-_OgH) languages. ![RPA for Python architecture](https://raw.githubusercontent.com/tebelorg/Tump/master/TagUI-Python/architecture.png) I would like to credit and express my appreciation below :heart:, and you are invited to [connect on LinkedIn](https://www.linkedin.com/in/kensoh) - - [TagUI](https://github.com/kelaberetiv/TagUI) - AI Singapore from Singapore / [@aisingapore](https://www.aisingapore.org) - [SikuliX](https://github.com/RaiMan/SikuliX1) - Raimund Hocke from Germany / [@RaiMan](https://github.com/RaiMan) - [CasperJS](https://github.com/casperjs/casperjs) - Nicolas Perriault from France / [@n1k0](https://github.com/n1k0) - [PhantomJS](https://github.com/ariya/phantomjs) - Ariya Hidayat from Indonesia / [@ariya](https://github.com/ariya) - [SlimerJS](https://github.com/laurentj/slimerjs) - Laurent Jouanneau from France / [@laurentj](https://github.com/laurentj) - [Philip Vollet](https://www.linkedin.com/in/philipvollet) from Germany, for spreading the word. Philip is a veteran in NLP and open-source. His sharing of RPA for Python helps spread the word to the broader open-source community that there's [pip install rpa](https://www.linkedin.com/posts/philipvollet_github-tebelorgrpa-python-python-package-activity-6823555313950699520-JjEk). ![Philip's LinkedIn Post](https://raw.githubusercontent.com/tebelorg/Tump/master/philip_vollet.png) # License RPA for Python is open-source software released under Apache 2.0 license