How to use Gemini to bypass image captcha when web scraping

In this project, I tackled the challenge of automating text summarization for my wife’s judicial studies by scraping Brazil’s Supreme Federal Court decisions. Along the way, I hit a snag with image captchas and devised a clever workaround using an LLM to solve them, feeding it screenshots and simulating clicks on canvas elements. This notebook showcases that process, offering a practical example of integrating LLMs into web scraping—perfect for anyone facing similar hurdles.

Which Economic Tasks are Performed with AI? Evidence from Millions of Claude Conversations

This paper, “Which Economic Tasks are Performed with AI? Evidence from Millions of Claude Conversations,” presents a novel approach to understanding the real-world integration of Artificial Intelligence (AI) into the economy. By analyzing over four million conversations from the Claude.ai platform, the authors provide empirical evidence of how AI is currently being used across various […]

How to extract images and drawings from PDF with Python

Extracting images and drawings from PDF files can be a challenging task, but with the right tools and techniques, it’s entirely achievable. This blog post explores how to use the PyMuPDF library in Python to extract both images and drawings from PDF documents. We’ll dive into the nuances of handling transparency layers in images and clustering drawings to preserve embedded text. Whether you’re building a PDF summarizer or simply need to extract visual content from PDFs, these methods provide a robust solution to automate the process.

ARIMA and Online Learning in Financial Forecasting

I discuss the development of an online learning system using the Jane Street Real-Time Market Data Forecasting challenge as a practice ground for time-series forecasting. The project involves predicting the responder_6 variable using an ARIMA model, with a focus on adapting to new data by re-training the model whenever a new date_id is encountered. This approach leverages multiprocessing to meet strict time constraints

Walk Forward Validation on Jane Street Real-Time Market Data Forecast

Walk Forward Validation (WFV) involves a training window that moves forward in time, training the model on historical data and then validating it on future, unseen data points. Unlike traditional cross-validation where data is randomly split, WFV respects the sequence of time, making it ideal for datasets with time-dependent features like stock prices, weather patterns, or sales figures.

How to create and save charts with CrewAI agents and AWS S3

In the GDSC7 challenge, we’ve upgraded our agent system to create and display charts in response to user queries, using AWS S3 for image storage. The new chart.py tool leverages Pandas, Matplotlib, and Seaborn to generate various chart types, enhancing our system’s capability to present data visually. This integration allows us to effectively showcase complex information, such as the correlation between GDP and reading skills from the PIRLS 2021 study, improving user engagement and interaction.