<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Pandas on xzerad&#39;s blog</title>
    <link>https://xzerad.github.io/tags/pandas/</link>
    <description>Recent content in Pandas on xzerad&#39;s blog</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Thu, 05 Sep 2024 08:52:07 +0100</lastBuildDate>
    <atom:link href="https://xzerad.github.io/tags/pandas/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Looping in Pandas DataFrames: A Common Mistake</title>
      <link>https://xzerad.github.io/posts/looping-pandas-dataframes-common-mistake/</link>
      <pubDate>Thu, 05 Sep 2024 08:52:07 +0100</pubDate>
      <guid>https://xzerad.github.io/posts/looping-pandas-dataframes-common-mistake/</guid>
      <description>Introduction If you’ve just started in the data engineering field, you’ve probably used the pandas library—a powerful tool that lets you read structured data from various sources and formats, perform calculations, and export it to different formats. What more could you wish for? But when working on small datasets, we often don’t pay attention to performance. We write a script that does what we need, test it on a small dataset, and everything works perfectly.</description>
    </item>
  </channel>
</rss>
