Problem Solution Discussion Importing XML into MySQL
10.43 Importing XML into MySQL
10.43.1 Problem
You want t o im port an XML docum ent int o a MySQL t able.10.43.2 Solution
Set up an XML parser t o read t he docum ent . Then use t he records in t he docum ent t o const ruct and execut e INSERT st at em ent s.10.43.3 Discussion
I m port ing an XML docum ent depends on being able t o parse t he docum ent and ext ract record cont ent s from it . The way you do t his will depend on how t he docum ent is writ t en. For exam ple, one form at m ight represent colum n nam es and values as at t ribut es of column elem ent s: ?xml version=1.0 encoding=UTF-8? rowset row column name=subject value=Jane column name=test value=A column name=score value=47 row row column name=subject value=Jane column name=test value=B column name=score value=50 row ... rowset Anot her form at is t o use colum n nam es as elem ent nam es and colum n values as t he cont ent s of t hose elem ent s: ?xml version=1.0 encoding=UTF-8? rowset row subjectJanesubject testAtest score47score row row subjectJanesubject testBtest score50score row ... rowset Due t o t he various st ruct uring possibilit ies, it s necessary t o m ake som e assum pt ions about t he form at you expect t he XML docum ent t o have. For t he exam ple here, I ll assum e t he second form at j ust shown. One way t o process t his kind of docum ent is t o use t he XML: : XPat h m odule, which allows you t o refer t o elem ent s wit hin t he docum ent using pat h expressions. For exam ple, t he pat h row select s all t he row elem ent s under t he docum ent root , and t he pat h select s all children of a given elem ent . We can use t hese pat hs wit h XML: : XPat h t o obt ain first a list of all t he row elem ent s, and t hen for each row a list of all it s colum ns. The follow ing script , xm l_t o_m ysql.pl, t akes t hree argum ent s: xml_to_mysql.pl db_name tbl_name xml_file The filenam e argum ent indicat es which docum ent t o im port , and t he dat abase and t able nam e argum ent s indicat e which t able t o im port it int o. xm l_t o_m ysql.pl processes t he com m and- line argum ent s and connect s t o MySQL not shown , t hen processes t he docum ent : usrbinperl -w xml_to_mysql.pl - read XML file into MySQL use strict; use DBI; use XML::XPath; ... process command-line options not shown ... ... connect to database not shown ... Open file for reading my xp = XML::XPath-new filename = file_name; my row_list = xp-find row; find set of row elements print Number of records: . row_list-size . \n; foreach my row row_list-get_nodelist loop through rows { my name; array for column names my val; array for column values my col_list = row-find ; children columns of row foreach my col col_list-get_nodelist loop through columns { save column name and value push name, col-getName ; push val, col-string_value ; } construct INSERT statement, then execute it my stmt = INSERT INTO tbl_name . join ,, name . VALUES . join ,, ? x scalar val . ; dbh-do stmt, undef, val; } dbh-disconnect ; exit 0; The script creat es an XML::XPath obj ect , which opens and parses t he docum ent . Then t he obj ect is queried for t he set of row elem ent s, using t he pat h row . The size of t his set indicat es how m any records t he docum ent cont ains. To process each row, t he script uses t he pat h t o ask for all t he children of t he row obj ect . Each child corresponds t o a colum n wit hin t he row; using as t he pat h for get_nodelist t his way is convenient because we need not know in advance which colum ns t o expect . xm l_t o_m ysql.pl obt ains t he nam e and value from each colum n and saves t hem in t he name and value arrays. Aft er all t he colum ns have been processed, t he arrays are used t o const ruct an INSERT st at em ent t hat nam es t hose colum ns t hat were found t o be present in t he row and t hat includes a placeholder for each dat a value. Recipe 2.7 discusses placeholder list const ruct ion. Then t he script issues t he st at em ent , passing t he colum n values t o do t o bind t hem t o t he placeholders. I n t he previous sect ion, we used m ysql_t o_xm l.pl t o export t he cont ent s of t he expt t able as an XML docum ent . xm l_t o_m ysql.pl can be used t o perform t he converse operat ion of im port ing t he docum ent back int o MySQL: xml_to_mysql.pl cookbook expt expt.xml As it processes t he docum ent , t he script generat es and execut es t he following set of st at em ent s: INSERT INTO expt subject,test,score VALUES Jane,A,47 INSERT INTO expt subject,test,score VALUES Jane,B,50 INSERT INTO expt subject,test VALUES Jane,C INSERT INTO expt subject,test VALUES Jane,D INSERT INTO expt subject,test,score VALUES Marvin,A,52 INSERT INTO expt subject,test,score VALUES Marvin,B,45 INSERT INTO expt subject,test,score VALUES Marvin,C,53 INSERT INTO expt subject,test VALUES Marvin,D Not e t hat t hese st at em ent s do not all insert t he sam e num ber of colum ns. St at em ent s wit h m issing colum ns correspond t o rows wit h NULL values.10.44 Epilog
Parts
» O'Reilly-MySQL.Cookbook.eBook-iNTENSiTY. 4810KB Mar 29 2010 05:03:43 AM
» Introduction Using the mysql Client Program
» Problem Solution Discussion Setting Up a MySQL User Account
» Problem Solution Discussion Starting and Terminating mysql
» Problem Solution Discussion Specifying Connection Parameters by Using Option Files
» Problem Solution Discussion Mixing Command-Line and Option File Parameters
» Problem Solution Discussion What to Do if mysql Cannot Be Found
» Problem Solution Discussion Setting Environment Variables
» Problem Solution Discussion Repeating and Editing Queries
» Problem Solution Discussion Preventing Query Output from Scrolling off the Screen
» Problem Solution Discussion Specifying Arbitrary Output Column Delimiters
» Problem Solution Discussion Logging Interactive mysql Sessions
» Discussion Using mysql as a Calculator
» Writing Shell Scripts Under Unix
» Writing Shell Scripts Under Windows
» MySQL Client Application Programming Interfaces
» Perl Connecting to the MySQL Server, Selecting a Database, and Disconnecting
» PHP Connecting to the MySQL Server, Selecting a Database, and Disconnecting
» Python Connecting to the MySQL Server, Selecting a Database, and Disconnecting
» Java Connecting to the MySQL Server, Selecting a Database, and Disconnecting
» Problem Solution Discussion Checking for Errors
» Python Java Checking for Errors
» Problem Solution Discussion Writing Library Files
» Python Writing Library Files
» SQL Statement Categories Issuing Queries and Retrieving Results
» Perl Issuing Queries and Retrieving Results
» Python Issuing Queries and Retrieving Results
» Java Issuing Queries and Retrieving Results
» Problem Solution Discussion Moving Around Within a Result Set
» Problem Solution Discussion Using Prepared Statements and Placeholders in Queries
» Perl Using Prepared Statements and Placeholders in Queries
» PHP Python Java Using Prepared Statements and Placeholders in Queries
» Problem Solution Discussion Including Special Characters and NULL Values in Queries
» Perl Including Special Characters and NULL Values in Queries
» PHP Including Special Characters and NULL Values in Queries
» Python Java Including Special Characters and NULL Values in Queries
» PHP Python Java Handling NULL Values in Result Sets
» Problem Solution Discussion Writing an Object-Oriented MySQL Interface for PHP
» Class Overview Writing an Object-Oriented MySQL Interface for PHP
» Connecting and Disconnecting Writing an Object-Oriented MySQL Interface for PHP
» Error Handling Issuing Queries and Processing the Results
» Quoting and Placeholder Support
» Problem Solution Discussion Ways of Obtaining Connection Parameters
» Getting Parameters from the Command Line
» Getting Parameters from Option Files
» Conclusion and Words of Advice
» Problem Solution Discussion Avoiding Output Column Order Problems When Writing Programs
» Problem Solution Discussion Using Column Aliases to Make Programs Easier to Write
» Problem Solution Discussion Selecting a Result Set into an Existing Table
» Problem Solution Discussion Creating a Destination Table on the Fly from a Result Set
» Problem Solution Discussion Moving Records Between Tables Safely
» Problem Solution Discussion Cloning a Table Exactly
» Problem Solution Discussion Generating Unique Table Names
» Problem Solution Discussion Using TIMESTAMP Values
» Problem Solution Discussion Using ORDER BY to Sort Query Results
» Solution Discussion Working with Per-Group and Overall Summary Values Simultaneously
» Problem Solution Discussion Changing a Column Definition or Name
» Problem Solution Discussion Changing a Table Type
» Problem Solution Discussion Adding Indexes
» Introduction Obtaining and Using Metadata
» Problem Solution Discussion Perl PHP
» Problem Solution Discussion Perl
» PHP Obtaining Result Set Metadata
» Python Obtaining Result Set Metadata
» Java Obtaining Result Set Metadata
» Using Result Set Metadata to Get Table Structure
» Problem Solution Discussion Database-Independent Methods of Obtaining Table Information
» Problem Solution Discussion Displaying Column Lists Interactive Record Editing
» Mapping Column Types onto Web Page Elements Adding Elements to ENUM or SET Column Definitions
» Selecting All Except Certain Columns
» Problem Solution Discussion Listing Tables and Databases
» Problem Solution Writing Applications That Adapt to the MySQL Server Version
» Discussion Writing Applications That Adapt to the MySQL Server Version
» Problem Solution Discussion Determining Which Table Types the Server Supports
» General Import and Export Issues
» Problem Solution Discussion Importing Data with LOAD DATA and mysqlimport
» Problem Solution Discussion Specifying the Datafile Location
» Problem Solution Discussion Specifying the Datafile Format
» Problem Solution Discussion Dealing with Quotes and Special Characters
» Problem Solution Discussion Handling Duplicate Index Values
» Problem Solution Discussion Getting LOAD DATA to Cough Up More Information
» Problem Solution Discussion Dont Assume LOAD DATA Knows More than It Does
» Problem Solution Discussion Skipping Datafile Columns
» Problem Solution Discussion Exporting Query Results from MySQL
» Using the mysql Client to Export Data
» Problem Solution Discussion Exporting Tables as Raw Data
» Problem Solution Discussion Exporting Table Contents or Definitions in SQL Format
» Problem Solution Discussion Copying Tables or Databases to Another Server
» Problem Solution Discussion Writing Your Own Export Programs
» Problem Solution Discussion Converting Datafiles from One Format to Another
» Problem Solution Discussion Extracting and Rearranging Datafile Columns
» Problem Solution Discussion Validating and Transforming Data
» Writing an Input-Processing Loop Putting Common Tests in Libraries
» Problem Solution Discussion Validation by Pattern Matching
» Problem Solution Discussion Using Patterns to Match Numeric Values
» Problem Solution Discussion Using Patterns to Match Dates or Times
» See Also Using Patterns to Match Dates or Times
» Problem Solution Discussion Using Patterns to Match Email Addresses and URLs
» Problem Solution Discussion Validation Using Table Metadata
» Problem Solution Discussion Issue Individual Queries Construct a Hash from the Entire Lookup Table
» Use a Hash as a Cache of Already-Seen Lookup Values
» Problem Solution Discussion Converting Two-Digit Year Values to Four-Digit Form
» Problem Solution Discussion Performing Validity Checking on Date or Time Subparts
» Problem Solution Discussion Writing Date-Processing Utilities
» Problem Solution Discussion Performing Date Conversion Using SQL
» Problem Solution Discussion Guessing Table Structure from a Datafile
» Problem Solution Discussion A LOAD DATA Diagnostic Utility
» Problem Solution Discussion Exchanging Data Between MySQL and Microsoft Access
» Problem Solution Discussion Exchanging Data Between MySQL and Microsoft Excel
» Problem Solution Discussion Exchanging Data Between MySQL and FileMaker Pro
» Problem Solution Discussion Importing XML into MySQL
» Epilog Importing and Exporting Data
» Introduction Generating and Using Sequences
» Problem Solution Discussion Using AUTO_INCREMENT To Set Up a Sequence Column
» Problem Solution Discussion Choosing the Type for a Sequence Column
» Problem Solution Discussion Ensuring That Rows Are Renumbered in a Particular Order
» Problem Solution Discussion Managing Multiple Simultaneous AUTO_INCREMENT Values
» Problem Solution Discussion Using AUTO_INCREMENT Values to Relate Tables
» Problem Solution Discussion Generating Repeating Sequences
» Problem Solution Discussion See Also
» Performing a Related-Table Update Using Table Replacement
» Performing a Related-Table Update by Writing a Program
» Performing a Multiple-Table Delete by Writing a Program
» Problem Solution Discussion Dealing with Duplicates at Record-Creation Time
» Problem Solution Discussion Using Transactions in Perl Programs
» Problem Solution Discussion Using Transactions in Java Programs
» Problem Solution Discussion Using Alternatives to Transactions
» Grouping Statements Using Locks
» Rewriting Queries to Avoid Transactions
» Introduction Introduction to MySQL on the Web
» Problem Solution Discussion Basic Web Page Generation
» Problem Solution Discussion Using Apache to Run Web Scripts
» Problem Solution Discussion Using Tomcat to Run Web Scripts
» Installing the mcb Application
» Installing the JSTL Distribution
» Problem Solution Discussion Encoding Special Characters in Web Output
» General Encoding Principles Encoding Special Characters in Web Output
» Encoding Special Characters Using Web APIs
» Introduction Incorporating Query Results into Web Pages
» Problem Solution Discussion Creating a Navigation Index from Database Content
» Creating a Multiple-Page Navigation Index
» Problem Solution Discussion Storing Images or Other Binary Data
» Storing Images with LOAD_FILE Storing Images Using a Script
» Problem Solution Discussion Retrieving Images or Other Binary Data
» Problem Solution Discussion Serving Banner Ads
» Problem Solution Discussion Serving Query Results for Download
» Introduction Processing Web Input with MySQL
» Problem Solution Discussion Creating Forms in Scripts
» Problem Solution Discussion Creating Multiple-Pick Form Elements from Database Content
» Problem Solution Discussion Loading a Database Record into a Form
» Problem Solution Discussion Collecting Web Input
» Web Input Extraction Conventions Perl
» Problem Solution Discussion Validating Web Input
» Problem Solution Discussion Using Web Input to Construct Queries
» Problem Solution Discussion Processing File Uploads
» Perl Processing File Uploads
» Problem Solution Discussion Performing Searches and Presenting the Results
» Problem Solution Discussion Generating Previous-Page and Next-Page Links
» Paged Displays with Previous-Page and Next-Page Links
» Paged Displays with Links to Each Page
» Problem Solution Discussion Web Page Access Counting
» Problem Solution Discussion Web Page Access Logging
» Problem Solution Discussion Setting Up Database Logging
» Other Logging Issues Using MySQL for Apache Logging
» Session Management Issues Introduction
» Problem Solution Discussion Installing Apache::Session
» The Apache::Session Interface
» A Sample Application Using MySQL-Based Sessions in Perl Applications
» Problem Solution Discussion The PHP 4 Session Management Interface
» Specifying a User-Defined Storage Module
» Problem Solution Discussion Using MySQL for Session BackingStore with Tomcat
» The Servlet and JSP Session Interface A Sample JSP Session Application
Show more