So, a ~35MB XML file containing eight different types of records, something in the region of 50,000 of them in total (that's just a guesstimate, as I'm only parsing half of the types at this point), takes about four minutes to parse on my MBA. That's in Python with lxml, and I'm stuffing them into a PostgreSQL DB via Django's ORM. Not that bad, really, all things considered

Comment