Can Semantics Tech Eliminate the Need for Data Integration?

Loraine Lawson

Much of my adult life, I walk around in denial about what a complete nerd I am. I make excuses-I write about technology, I don't technically work in it. Yeah, I'm on Twitter, Facebook, LinkedIn, and LaLa, but I don't have a Kindle, Treo or Blackberry. And sure, the Origins gaming con was the best time I've had in a long time, and I kicked it old school with D&D -- but I did not dress up for live action role playing (LARP).


Ah, the splitting of semantic hairs. The truth is, I'm four levels in on the Geek Hierarchy, just above Trekkies who speak Klingon and LARPers.


I'm a lost cause, it seems, so I might as well fess up to the fact semantic technology fascinates me, particularly when you move beyond the over-hyped "Web semantics" stuff and look at real applications-like, for instance, this recent article on Bio-IT World.


It's an interview with Ted Slater, who heads a small group of informatics scientists at Pfizer's Indications and Pathways Center of Emphasis (IPCoE). Basically, this group supports Pfizer's pharmaceutical research. Using semantic technologies, the team developed a new system called "Pfizer Environment for Knowledge Engineering," or PEKE.


While we may be a decade or more out from a semantic Web, Gartner believes semantic technology will emerge as one of the 10 most disruptive technologies in the next four years. PEKE makes me think they could be right.


Interestingly, what drove the team to create the PEKE architecture was the simple realization that data integration is like trying to drink from a fire hose-there's too much data, silos are created too quickly, and integration work can't keep up. Or, as Slater explained it:

"We constantly hear that the Holy Grail is complete data integration ... I have bad news - it will never happen! Users are able to set up and start building new, independent repositories of data faster than we can integrate existing data. You will never be able to get it all in one place where it is integrated and usable. The goal instead should be data that are interoperable, even if they are not integrated.

PEKE addresses this by letting Pfizer create new knowledge bases simply and quickly, according to Slater.


The article offers an bird's eye view of how PEKE works. Slater adapted the semantic RDF format to represent the data as a mathematical graph. The team also used open source ontology development tools and Cytoscape, which is also open source, to view the data graphically. They also employed Oracle's RDF data model.


I admit I don't really understand it. I also still don't completely understand how a computer translates 0s and 1s to generate, say, this blog post or an Excel spreadsheet. But I can still appreciate how cool both are, in a completely geeky way.

Subscribe to our Newsletters

Sign up now and get the best business technology insights direct to your inbox.


Add Comment      Leave a comment on this blog post
Apr 27, 2009 10:57 AM John O'Gorman John O'Gorman  says:

Hey Lorraine;

I must admit that I too have had to reconcile my work with the label most of my friends and family attach to me and my work.  Thanks for helping me accept the nakid truth.

I'm working on a framework that may accelerate Gartner's timeline. It is called QQ - nicknamed Quantum Semantics - and like chess (and all languages) it has a small number of moving parts, simple rules and the ability to generate an infinite number of combinations and permutations.

I will keep you aprised of my progress if you like.

John O'

Apr 28, 2009 5:43 PM Loraine Lawson Loraine Lawson  says:

That would be great!

Sep 3, 2009 10:08 AM Felix Van de Maele Felix Van de Maele  says: in response to Loraine Lawson

Hi Loraine,

Great to see your interested in semantic technology and semantic data integration in particular.

I fully agree with you that we should move beyond the web semantics hype. While it has great promises, we are far from there yet.

Instead, I'd like to point you to a presentation of SCA Packaging's use of semantic data integration in their application integration architecture.

Basically, we allow SCA Packaging to define the business semantics (business context if you will) of the different systems, and let every system "commit" itself onto these business semantics. The actual integration is taken care through these shared semantics.

You can find a brief presentation on it here: http://tinyurl.com/d6b27u

I'd love to walk you through it and provide some background information.

Felix Van de Maele




Post a comment





(Maximum characters: 1200). You have 1200 characters left.




Subscribe Daily Edge Newsletters

Sign up now and get the best business technology insights direct to your inbox.

Subscribe Daily Edge Newsletters

Sign up now and get the best business technology insights direct to your inbox.