xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema.
Obtained PostgreSQL database can be optimized at user's discretion. Moreover, differential update, and XPath query evaluation over PostgreSQL are possible.
PgSchema server, serialized relational data model server, can be used to speed up the analysis of complex XML Schema.
Large XML file can be split through xmlsplitter, a flexible XML splitter based on XPath and StAX.

Features

  • Conform to W3C XML Schema Definition Lauguage 1.1
  • Generete complete PostgreSQL DDL from XML Schema 1.1 (relationally extended)
  • Generate minimum PostgreSQL DDL from XML Schema 1.1 (inlining simple content)
  • Optionally, reduce database size by mapping integers and decimals to primitive types
  • Generate JSON Schema from XML Schema 1.1 (mapping strategy is selectable)
  • Migrate XML data into relational database (PostgreSQL)
  • Assist PostgreSQL index creation based on XML Schema and data size
  • File conversion from XML to CSV, TSV, or JSON
  • Indexing XML contents using Apache Lucene or Sphinx Search (xmlpipe2)
  • Fast XML Schema analysis with help of PgSchema server
  • Parallel data processing of multiple XML documents
  • Parallel XML Schema validtion of multiple XML documents
  • Validate XML Schema itself using W3C's schema for schema
  • Differential update based on modification date and checksum
  • Splitting large XML file for pre-processing
  • XPath 1.0 parser being aware of XML Schema
  • Query translation from XPath 1.0 to SQL (PostgreSQL)
  • XPath 1.0 query evaluation to XML over PostgreSQL
  • XPath 1.0 query evaluation to JSON over PostgreSQL
  • Preserving document order as is
  • No defects on information content including wild card

Project Activity

See All Activity >

Categories

XML, Database, JSON

License

Apache License V2.0

Follow xsd2pgschema

xsd2pgschema Web Site

Other Useful Business Software
Open source. Easy to use. Proven. Complete. Icon
Open source. Easy to use. Proven. Complete.

End to end big data that enables you to spend less time formatting data and more time analyzing it.

Discover HPCC Systems - the truly open source big data solution that allows you to quickly process, analyze and understand large data sets, even data stored in massive, mixed-schema data lakes. Designed by data scientists, HPCC systems is a complete integrated solution from data ingestion and data processing to data delivery. The free online introductory courses and a robust developer community allow you to get started quickly.
Rate This Project
Login To Rate This Project

User Ratings

★★★★★
★★★★
★★★
★★
1
0
0
0
0
ease 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
features 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5
design 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 4 / 5
support 1 of 5 2 of 5 3 of 5 4 of 5 5 of 5 5 / 5

User Reviews

  • Extremely well documented and purpose driven tool.
Read more reviews >

Additional Project Details

Operating Systems

Linux, FreeBSD, Mac, Windows

Languages

English

Intended Audience

Information Technology, Science/Research

User Interface

Console/Terminal

Programming Language

Unix Shell, Java

Database Environment

PostgreSQL (pgsql)

Related Categories

Unix Shell XML Software, Unix Shell Database Software, Unix Shell JSON Software, Java XML Software, Java Database Software, Java JSON Software

Registered

2014-11-28