<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:clearspace="http://www.jivesoftware.com/xmlns/clearspace/rss" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:opensearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>Thinking Inside the Box</title>
    <link>http://www.enzeecommunity.com/blogs/nzblog</link>
    <description>Welcome to "Thoughts from Inside the Box" - Netezza's officially endorsed blog.</description>
    <pubDate>Thu, 22 Jul 2010 11:06:27 GMT</pubDate>
    <generator>Clearspace 2.5.3 (http://jivesoftware.com/products/clearspace/)</generator>
    <dc:date>2010-07-22T11:06:27Z</dc:date>
    <item>
      <title>Hadoop &amp; Netezza: Synergy in Data Analytics – PART 2</title>
      <link>http://www.enzeecommunity.com/blogs/nzblog/2010/07/22/hadoop-netezza-synergy-in-data-analytics-part-2</link>
      <description>&lt;!-- [DocumentBodyStart:6cec7cb9-1c0d-45b4-8197-746774b95b52] --&gt;&lt;div class='jive-rendered-content'&gt;&lt;p&gt;&lt;!--StartFragment--&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: left;"&gt;&lt;span style="font-family: arial, helvetica, sans-serif;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: left;"&gt;&lt;span&gt;I mentioned in my previous post that Netezza is excited about our partnership with Cloudera and Hadoop because we’ve already seen some of our customers benefit from the synergy of Hadoop and Netezza TwinFin™ technologies working together.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;As I noted, these types of strategies play to the strengths of both technologies and roughly break down into two categories: 1)&lt;/span&gt; the &lt;a class="jive-link-external-small" href="http://bit.ly/bt4esN"&gt;use of a Hadoop Cluster for data ingestion&lt;/a&gt;&lt;span&gt;, and 2) using a Hadoop Cluster for long-term data retention, which I’m addressing today.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;strong&gt;Netezza TwinFin with a Hadoop Cluster Used for Queryable Archive Analytics&lt;/strong&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;The second pattern we have seen customers deploy is one in which the Hadoop Cluster is used for long-term data retention, or as a “queryable archive”. Here one could think of Hadoop as a complementary analytic extension of the Netezza TwinFin when there is far less premium placed on low-latency or high-performance. In addition to the weblog and unstructured data analysis discussed in Pattern 1, the queryable archive could also retain long-term copies of structured data that had previously been loaded into the high-performance TwinFin appliance.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;a href="http://www.enzeecommunity.com/servlet/JiveServlet/showImage/38-1159-1600/Hadoop-NZ+3.jpg"&gt;&lt;img alt="Hadoop-NZ 3.jpg" class="jive-image" src="http://www.enzeecommunity.com/servlet/JiveServlet/downloadImage/38-1159-1600/Hadoop-NZ+3.jpg"/&gt;&lt;/a&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;strong&gt;Hadoop Cluster Used for Queryable Archive&lt;/strong&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;!--StartFragment--&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;With a mix of structured, semi-structured and unstructured data loaded across the two complementary systems, customers can alter the level of granularity and data retention periods across each and typically use TwinFin for processing “hot” data and the Hadoop Cluster for processing “cool” or “cold” data, perhaps with specialized analytics. A deployment of this pattern could look like the following diagram:&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;a href="http://www.enzeecommunity.com/servlet/JiveServlet/showImage/38-1159-1601/Hadoop-NZ+Arch+3.jpg"&gt;&lt;img alt="Hadoop-NZ Arch 3.jpg" class="jive-image" src="http://www.enzeecommunity.com/servlet/JiveServlet/downloadImage/38-1159-1601/Hadoop-NZ+Arch+3.jpg"/&gt;&lt;/a&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;!--StartFragment--&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;Readers should view this pair of posts as a “point-in-time” look at the market. Our customers continue to innovate and make use of the complementary strengths of TwinFin and Hadoop. And Netezza will continue to innovate both inside the appliance – adding performance, scale, workload management capabilities and especially with the advanced analytics of i-Class, through partnerships like the one&lt;/span&gt; &lt;span&gt;announced&lt;/span&gt; &lt;a class="jive-link-external-small" href="http://www.netezza.com/releases/2010/release071510.htm"&gt;with Cloudera&lt;/a&gt; &lt;span&gt;a week ago, and through expansion of our platform, software and virtualization capabilities beyond the TwinFin and Skimmer™ appliances. Those innovations should help alter and/or enhance some of the deployment directions discussed here.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;strong&gt;Now, as I said at the outset of these two posts, I’d like to hear from you on your Netezza &amp;amp; Hadoop co-existence deployment and/or compatibility wish-list ideas. What would you like to see?&lt;/strong&gt;&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:6cec7cb9-1c0d-45b4-8197-746774b95b52] --&gt;</description>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">twinfin</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">cloudera</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">data_warehouse</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">i-class</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">performance</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">analytics</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">bi</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">hadoop</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">sql</category>
      <pubDate>Thu, 22 Jul 2010 11:06:27 GMT</pubDate>
      <author>pfrancisco</author>
      <guid>http://www.enzeecommunity.com/blogs/nzblog/2010/07/22/hadoop-netezza-synergy-in-data-analytics-part-2</guid>
      <dc:date>2010-07-22T11:06:27Z</dc:date>
      <clearspace:dateToText>1 month, 1 week ago</clearspace:dateToText>
      <wfw:comment>http://www.enzeecommunity.com/blogs/nzblog/comment/hadoop-netezza-synergy-in-data-analytics-part-2</wfw:comment>
      <wfw:commentRss>http://www.enzeecommunity.com/blogs/nzblog/feeds/comments?blogPost=1159</wfw:commentRss>
    </item>
    <item>
      <title>Hadoop &amp; Netezza: Synergy in Data Analytics Results in New Customer Deployment Trends – PART 1</title>
      <link>http://www.enzeecommunity.com/blogs/nzblog/2010/07/20/hadoop-netezza-synergy-in-data-analytics-results-in-new-customer-deployment-trends-part-1</link>
      <description>&lt;!-- [DocumentBodyStart:3a30bca8-7d8d-4b71-bb45-7609f818dd09] --&gt;&lt;div class='jive-rendered-content'&gt;&lt;p&gt;&lt;!--StartFragment--&gt;&lt;/p&gt;&lt;div class="jive-quote"&gt;&lt;p class="MsoNormal"&gt;&lt;em&gt;Two things before I begin:&lt;/em&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;em&gt;&lt;span&gt;I’ll begin this posting with a call for inputs. Below I will list a few of the most common Hadoop/Netezza co-existence deployment patterns we have seen to date. But I would like to hear from others. As you see the continuing deployment of Hadoop in the enterprise and as the&lt;/span&gt; &lt;a class="jive-link-external-small" href="http://www.netezza.com/releases/2010/release062110_4.htm"&gt;Second Wave of TwinFin™&lt;/a&gt; &lt;span&gt;comes on with the advanced analytics capabilities of i-Class, how do you see the evolving deployment patterns happening in your environment?&lt;/span&gt;&lt;/em&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;&lt;em&gt;&lt;span&gt;&lt;/span&gt;&lt;/em&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;em&gt;&lt;span&gt;A special hat-tip to Krishnan Parasuraman, Netezza’s Chief Architect for our Digital Media group, for his excellent help in aiding and abetting this post! I have used his guidance gratefully and (with his permission) stolen freely from some of his inputs.&lt;/span&gt;&lt;/em&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;You may have noticed a partnership announcement made by&lt;/span&gt; &lt;a class="jive-link-external-small" href="http://www.netezza.com/releases/2010/release071510.htm"&gt;Cloudera and Netezza&lt;/a&gt; &lt;span&gt;late last week. Together with Cloudera, Netezza will open up data movement and transformation between Cloudera’s Distribution for Hadoop and the Netezza family of appliances applications and data flows for integration of the two systems. We expect that our partnership with Cloudera, together with the Hadoop support in Netezza’s i-Class™ set of advanced analytics capabilities that are included as part of the upcoming release 6.0 software release, will lead to some very innovative and expansive applications for our customers and for both companies.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;Even today, Netezza customers are doing some very interesting things with deployment of Hadoop and our TwinFin data warehouse appliance. Far from being the “Hadoop v. SQL” battle that some people might like to make the current market out to be, we have instead noticed a growing number of “co-existence” deployment strategies and design patterns already at work with our customers – particularly among customers in the&lt;/span&gt; &lt;a class="jive-link-external-small" href="http://www.netezza.com/data-warehouse-appliance-industries/digital-media.aspx"&gt;“Digital Media”&lt;/a&gt; &lt;span&gt;vertical market.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;These types of strategies can play to the strengths of both technologies and roughly break down into two categories: 1) the use of a Hadoop Cluster for data ingestion, which I’ll write about in further detail today; and 2) using a Hadoop Cluster for long-term data retention, or as a “queryable archive,” for which I’ll go into further detail in a post later this week.&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;strong&gt;Using a Hadoop Cluster for Raw Data Ingestion&lt;/strong&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;The use of a Hadoop Cluster as the engine for data ingestion is the most common “co-existence” pattern we see in our customers’ mutual deployments of Hadoop and Netezza. The deployment pattern typically arises when the customer has hit specific performance and processing throughput scalability limitations with their existing Data Integration or ETL implementation.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;Raw weblog data is the primary data source for most Digital Media analytics and reporting requirements. Weblogs are data rich (e.g., page views, impressions, click-throughs and demographics collected from applications servers). They are typically semi-structured and collected and stored in flat files.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;There are some critical facts about weblogs that present real performance challenges in processing them:&lt;/span&gt;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;em&gt;&lt;span&gt;sheer volume&lt;/span&gt;&lt;/em&gt;&lt;span&gt;: millions of rows of weblog data collected throughout the day and loaded daily into the data warehouse;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;em&gt;&lt;span&gt;complex query processing&lt;/span&gt;&lt;/em&gt;&lt;span&gt;: parsing and decoding encoded character strings requires text processing, pattern matching, tokenizing type capabilities within the ETL process&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;em&gt;&lt;span&gt;non-conformed dimensions&lt;/span&gt;&lt;/em&gt;&lt;span&gt;: collecting page views or impression data defined and represented differently by various systems makes fitting them into conformed dimensions is another very common data ingestion &amp;amp; processing challenge.&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;There are two common variants of this pattern – dealing with semi-structured (e.g., weblogs) and unstructured (e.g., text) data and often customers will have versions of both variants in operation simultaneously.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;a href="http://www.enzeecommunity.com/servlet/JiveServlet/showImage/38-1158-1599/Hadoop-NZ+2.png"&gt;&lt;img alt="Hadoop-NZ 2.png" class="jive-image-thumbnail jive-image" src="http://www.enzeecommunity.com/servlet/JiveServlet/downloadImage/38-1158-1599/Hadoop-NZ+2.png" width="620"/&gt;&lt;/a&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;strong&gt;Semi-structured data ingest via Hadoop&lt;/strong&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;Semi-structured data is parsed (and possibly aggregated as well) in the Hadoop Cluster and then loaded into a TwinFin where the performance and workload scaling of the appliance is important for deeper analysis, higher throughput and faster reporting.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;span&gt;&lt;a href="http://www.enzeecommunity.com/servlet/JiveServlet/showImage/38-1158-1598/Hadoop-NZ+1.jpg"&gt;&lt;img alt="Hadoop-NZ 1.jpg" class="jive-image" src="http://www.enzeecommunity.com/servlet/JiveServlet/downloadImage/38-1158-1598/Hadoop-NZ+1.jpg"/&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="text-align: center;"&gt;&lt;strong&gt;Unstructured data ingest via Hadoop&lt;/strong&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;Unstructured data in this pattern is contextualized (classified, mined, keyworded and indexed) in Hadoop and then moved into a Netezza TwinFin appliance for the low-latency, high-performance analytics used to drive business decisions.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;A Hadoop Cluster&lt;/span&gt; &lt;span&gt;provides a scalable ingestion mechanism that is well suited for addressing the challenges described above. The Cluster can be incrementally scaled to handle ingesting the massive volumes of weblog data and it can support text processing and complex data processing through programming languages such as Java or Python.&lt;/span&gt; &lt;em&gt;[Note that with the coming i-Class set of analytics functionality, the programmability and some of the complex data processing may also be possible on the TwinFin, depending on a customer’s applications needs or preference.]&lt;/em&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;Following the data ingest steps, processed weblog information is brought into TwinFin as atomic event information or as summarized tables, depending on the size of the appliance and analytic maturity &amp;amp; scale of the organization where it is deployed. A typical deployment might look like the following diagram:&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;a href="http://www.enzeecommunity.com/servlet/JiveServlet/showImage/38-1158-1596/Hadoop-NZ+Arch+1.jpg"&gt;&lt;img alt="Hadoop-NZ Arch 1.jpg" class="jive-image" src="http://www.enzeecommunity.com/servlet/JiveServlet/downloadImage/38-1158-1596/Hadoop-NZ+Arch+1.jpg"/&gt;&lt;/a&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;!--StartFragment--&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;An alternate, far less common, deployment design of the above co-existence pattern is used by some of our customers. That is the use of an external elastic MapReduce cloud (such as the Amazon Cloud) for the data ingestion purposes.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;In cases where the customer may have its application servers in the Amazon’s EC2 cluster, they may also choose to use Amazon’s S3 web services for retaining weblog data. In that case, Amazon would provide the elastic MapReduce infrastructure for the data ingest process into the TwinFin appliance. This alternative deployment scenario would look something like the following:&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;a href="http://www.enzeecommunity.com/servlet/JiveServlet/showImage/38-1158-1595/Hadoop-NZ+Arch+2.jpg"&gt;&lt;img alt="Hadoop-NZ Arch 2.jpg" class="jive-image" src="http://www.enzeecommunity.com/servlet/JiveServlet/downloadImage/38-1158-1595/Hadoop-NZ+Arch+2.jpg"/&gt;&lt;/a&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;!--StartFragment--&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;span&gt;The bottom line is that the different strengths of TwinFin and Hadoop lend themselves to complementary deployments – and some of our customers have already discovered innovative ways to leverage them together to maximize the value of both their investments.&lt;/span&gt;&lt;/p&gt;&lt;p class="MsoNormal" style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;strong&gt;In my next post, I’ll discuss the second pattern we’re noticing: one in which Netezza customers are using the Hadoop Cluster for long-term data retention.&lt;/strong&gt;&lt;span&gt;&lt;/span&gt;&lt;/p&gt;&lt;!--EndFragment--&gt;&lt;!--EndFragment--&gt;&lt;!--EndFragment--&gt;&lt;!--EndFragment--&gt;&lt;!--EndFragment--&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:3a30bca8-7d8d-4b71-bb45-7609f818dd09] --&gt;</description>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">i-class</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">twinfin_i-class</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">twinfin</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">cloudera</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">netezza</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">mapreduce</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">performance</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">data_warehouse</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">advanced_analytics</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">analytics</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">etl</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">hadoop</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">java</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">sql</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">business_intelligence</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">data_integration</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">python</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">price-performance</category>
      <pubDate>Tue, 20 Jul 2010 11:06:27 GMT</pubDate>
      <author>pfrancisco</author>
      <guid>http://www.enzeecommunity.com/blogs/nzblog/2010/07/20/hadoop-netezza-synergy-in-data-analytics-results-in-new-customer-deployment-trends-part-1</guid>
      <dc:date>2010-07-20T11:06:27Z</dc:date>
      <clearspace:dateToText>1 month, 2 weeks ago</clearspace:dateToText>
      <wfw:comment>http://www.enzeecommunity.com/blogs/nzblog/comment/hadoop-netezza-synergy-in-data-analytics-results-in-new-customer-deployment-trends-part-1</wfw:comment>
      <wfw:commentRss>http://www.enzeecommunity.com/blogs/nzblog/feeds/comments?blogPost=1158</wfw:commentRss>
    </item>
    <item>
      <title>"Netezza Underground" - NPS at your bookstore</title>
      <link>http://www.enzeecommunity.com/blogs/nzblog/2008/11/26/netezza-underground-nps-at-your-bookstore</link>
      <description>&lt;!-- [DocumentBodyStart:9ec4ea3b-4294-45dc-9c88-e81c395a5889] --&gt;&lt;div class='jive-rendered-content'&gt;&lt;p&gt;&lt;a href="http://ecx.images-amazon.com/images/I/510z4tpvIML._SL500_AA240_.jpg"&gt;&lt;img src="http://ecx.images-amazon.com/images/I/510z4tpvIML._SL500_AA240_.jpg"/&gt;&lt;/a&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;We had quite a surprise the other day when it came to our attention that Netezza and the NPS data warehouse appliance are now the subjects of a new book: &lt;a class="jive-link-external-small" href="http://www.amazon.com/gp/product/1439207437"&gt;Netezza Underground: The unauthorized tales of derring-do and adventures in resilient data warehousing solutions&lt;/a&gt;, by David Birmingham (ISBN: 1-4392-0743-7 and now available in paperback version for $31.54 at &lt;a class="jive-link-external-small" href="http://www.amazon.com/gp/product/1439207437"&gt;Amazon.com&lt;/a&gt;). &lt;/p&gt;&lt;p&gt; &lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;This is not the first instance of the NPS system being the subject of a book sold by Amazon (e.g., &lt;a class="jive-link-external-small" href="http://www.amazon.com/ACCESS-Supplement-Netezza-Relational-Databases/dp/1599942771/"&gt;SAS/ACCESS(R) 9.1.3 Supplement for Netezza&lt;/a&gt;), but this particular publication certainly brought feelings of both fun and reaching into the mainstream with it, starting right from it's very clever cover art (above) to David's clever turns of phrase and real-life examples. &lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;As the title suggests, it was not written or coordinated with any Netezza authorization. So of course we bought a copy and read/skimmed through it as quickly as we could. I will say this, David's self-publication skills are great - he keeps what could easily have been a boring, heavy technical tome both engaging and fun to read while still imparting lots of great information about the NPS system, its performance and its ease of operation. And the book's publication is incredibly current - with references to Netezza Developer Network and "BI Appliance" announcements made only as recently as the Enzee Universe user conference in September. &lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;While I certainly could quibble with a point made here or there about the system, in general I thought it was an excellent book and even put up the following recommendation for it on the Amazon site: &lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;blockquote class="jive-quote"&gt;&lt;p&gt;&lt;em&gt;I commend David Birmingham on a book that is at once as lightly entertaining and interesting to read as it is chock full of details about just the kind of performance and operational simplicity that is possible with the Netezza Performance Server (NPS) system. Straightaway from the opening pages, Birmingham's effusive, engaging style and excitement about Netezza's system is apparent, "It inhales, crunches and publishes Libraries-of-Congress-at-a-time - and fast."&lt;/em&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;blockquote class="jive-quote"&gt;&lt;p&gt;&lt;em&gt;He also captures the essence of the NPS appliance in an ultra-succinct two-sentence paragraph explaining just why his "Administration Stuff" chapter is so short, "It's an appliance. Put it in the corner and let it work." I couldn't have said it better myself!&lt;/em&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;blockquote class="jive-quote"&gt;&lt;p&gt;&lt;em&gt;This book is comprehensive and current - even reflecting some of the more recent announcements from Netezza regarding OnStream programmability, the Netezza Developer Network and analytic appliances.&lt;/em&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;blockquote class="jive-quote"&gt;&lt;p&gt;&lt;em&gt;As the guy who is responsible for projecting the Netezza products and our technology direction forward, I want to recommend David Birmingham's book to current and prospective customers and partners alike, or as David himself says on the book's Dedication page, "to Enzees everywhere".&lt;/em&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;blockquote class="jive-quote"&gt;&lt;p&gt;&lt;em&gt;--Phil Francisco, VP Product Management &amp;amp; Marketing, Netezza Corporation&lt;/em&gt;&lt;/p&gt;&lt;/blockquote&gt;&lt;p&gt;So "to &lt;strong&gt;&lt;em&gt;Enzees&lt;/em&gt;&lt;/strong&gt; everywhere", have a read of David's book and welcome to the &lt;a class="jive-link-external-small" href="http://www.amazon.com/gp/product/1439207437"&gt;"Netezza Underground"&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:9ec4ea3b-4294-45dc-9c88-e81c395a5889] --&gt;</description>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">netezza</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">nps</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">enzee</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">birmingham</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">david_birmingham</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">sql</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">bi</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">business_intelligence</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">data_warehouse</category>
      <category domain="http://www.enzeecommunity.com/blogs/nzblog/tags">etl</category>
      <pubDate>Wed, 26 Nov 2008 20:37:04 GMT</pubDate>
      <author>pfrancisco</author>
      <guid>http://www.enzeecommunity.com/blogs/nzblog/2008/11/26/netezza-underground-nps-at-your-bookstore</guid>
      <dc:date>2008-11-26T20:37:04Z</dc:date>
      <clearspace:dateToText>1 year, 9 months ago</clearspace:dateToText>
      <clearspace:replyCount>2</clearspace:replyCount>
      <wfw:comment>http://www.enzeecommunity.com/blogs/nzblog/comment/netezza-underground-nps-at-your-bookstore</wfw:comment>
      <wfw:commentRss>http://www.enzeecommunity.com/blogs/nzblog/feeds/comments?blogPost=1055</wfw:commentRss>
    </item>
  </channel>
</rss>

