Wednesday, January 6, 2016

Data Modeling with Neo4j: “School Immunization in California” CSV to Graph

https://www.graphgrid.com/data-modeling-neo4j-school-immunization-california-csv-graph/

1 state, over 9 million children, and 42,981 rows of CSV immunization data. After many rough drafts, I was finally able to land on an efficient and aesthetically pleasing way to map out the immunization data of children in California (found and downloaded online from the California Department of Education*).In this post our goal is to walk through the data modeling process to show how this CSV data can be connected meaningfully with Neo4j. What makes this data so interesting is its varying degrees of location, three distinct grade levels, and a dense record of immunization numbers and percentages-all spanning over two separate school years.
After successfully mapping the data, I could then easily explore it, answering questions such as: Where in California has the lowest amount of children vaccinated?, Are less parents vaccinating their children in 2015 compared to 2014?, and Which age group is more up to date on its vaccinations?. Furthermore, I was able to clearly visualize the data in small and large quantities using the neo4j graph.
Tweet: Some guidelines on how to use @neo4j #graphdb #Cypher MERGE operations consistently and efficiently. http://ctt.ec/T5rcz+

No comments:

Post a Comment