<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/'><id>tag:blogger.com,1999:blog-8898949683610477251.post7570203001448264185..comments</id><updated>2010-04-06T23:00:34.785+01:00</updated><title type='text'>Comments on Tom White: Elastic Hadoop Clusters with Amazon's Elastic Bloc...</title><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://www.lexemetech.com/feeds/7570203001448264185/comments/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html'/><author><name>Tom White</name><uri>http://www.blogger.com/profile/02418758537880869494</uri><email>noreply@blogger.com</email></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>9</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-3381972836419486633</id><published>2010-04-06T23:00:34.785+01:00</published><updated>2010-04-06T23:00:34.785+01:00</updated><title type='text'>Any more insight on this front (Hadoop on EC2 with...</title><content type='html'>Any more insight on this front (Hadoop on EC2 with EBS or not)?&lt;br /&gt;  &lt;br /&gt;&lt;br /&gt;I just finished reading your book and about to finish up my Cloudera training in NY (tomorrow) and looking to setup Hadoop in our production network as well as an instance on EC2 (which I need to have persistant).  &lt;br /&gt;&lt;br /&gt;I saw a beta [maybe ok for our AWS deploy] with EBS on the Cloudera website for persistance and was not sure that was the latest and greatest?</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/3381972836419486633'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/3381972836419486633'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html?showComment=1270591234785#c3381972836419486633' title=''/><author><name>Joe Stein</name><uri>http://www.blogger.com/profile/16489566240096957255</uri><email>noreply@blogger.com</email></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7570203001448264185' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7570203001448264185' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-8370539441615999059</id><published>2009-09-23T15:51:15.256+01:00</published><updated>2009-09-23T15:51:15.256+01:00</updated><title type='text'>Tom,

I am looking heavily into EBS volumes. Quest...</title><content type='html'>Tom,&lt;br /&gt;&lt;br /&gt;I am looking heavily into EBS volumes. Question.  An EBS volume is in itself RAIDED. Typically hadoop datanodes and taskstrackers are suggested with no RAID and individual disks. What is the EBS suggestion? If you need 800GB of storage should you construct 8 10GB volumes or 1 800GB or 2 400TB?&lt;br /&gt;Is there a way of knowing where the EBS volume physically lives? &lt;br /&gt;&lt;br /&gt;Thanks Edward</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/8370539441615999059'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/8370539441615999059'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html?showComment=1253717475256#c8370539441615999059' title=''/><author><name>Edward</name><uri>http://www.blogger.com/profile/08276231429883104403</uri><email>noreply@blogger.com</email></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7570203001448264185' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7570203001448264185' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-326118397225648185</id><published>2009-01-30T21:02:00.000Z</published><updated>2009-01-30T21:02:00.000Z</updated><title type='text'>Jerry,I was referring to the arbitrary user data y...</title><content type='html'>Jerry,&lt;BR/&gt;&lt;BR/&gt;I was referring to the arbitrary &lt;A HREF="http://docs.amazonwebservices.com/AWSEC2/2007-03-01/DeveloperGuide/AESDG-chapter-instancedata.html" REL="nofollow"&gt;user data&lt;/A&gt; you can pass to an EC2 instance on boot, and whether that would work as a way of telling the instance which volume to attach. Another way would be to attach the volume to the instance from the client.&lt;BR/&gt;&lt;BR/&gt;Tom</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/326118397225648185'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/326118397225648185'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html?showComment=1233349320000#c326118397225648185' title=''/><author><name>Tom White</name><uri>http://www.blogger.com/profile/02418758537880869494</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='13156079507590169990'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7570203001448264185' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7570203001448264185' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-3167648433720655247</id><published>2009-01-29T05:18:00.000Z</published><updated>2009-01-29T05:18:00.000Z</updated><title type='text'>Thank you, Tom for this post. It is very helpful. ...</title><content type='html'>Thank you, Tom for this post. It is very helpful. &lt;BR/&gt;&lt;BR/&gt;The only thing I am confused about is how to tell an instance what EBS volume it should use. You mentioned "user data" yet I couldn't find any documentation on how to specify the EBS volume. Can you kindly shed some light on that?&lt;BR/&gt;&lt;BR/&gt;Thanks,&lt;BR/&gt;Jerry Lewis</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/3167648433720655247'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/3167648433720655247'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html?showComment=1233206280000#c3167648433720655247' title=''/><author><name>Anonymous</name><email>noreply@blogger.com</email></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7570203001448264185' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7570203001448264185' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-6165548849579145330</id><published>2008-09-30T23:05:00.000+01:00</published><updated>2008-09-30T23:05:00.000+01:00</updated><title type='text'>Perhaps a new section for the book?I just bought t...</title><content type='html'>Perhaps a new section for the book?&lt;BR/&gt;&lt;BR/&gt;I just bought the 'rough drafts' on Safari -- keep the revisions coming.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/6165548849579145330'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/6165548849579145330'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html?showComment=1222812300000#c6165548849579145330' title=''/><author><name>yourabi</name><uri>http://www.blogger.com/profile/17026881510091057284</uri><email>noreply@blogger.com</email></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7570203001448264185' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7570203001448264185' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-4500283007201505173</id><published>2008-09-12T20:49:00.000+01:00</published><updated>2008-09-12T20:49:00.000+01:00</updated><title type='text'>Yousef,Hmm, you're right. I think someone needs to...</title><content type='html'>Yousef,&lt;BR/&gt;&lt;BR/&gt;Hmm, you're right. I think someone needs to implement Hadoop on EBS and run a benchmark to compare it to EC2 local disks.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/4500283007201505173'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/4500283007201505173'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html?showComment=1221248940000#c4500283007201505173' title=''/><author><name>Tom White</name><uri>http://www.blogger.com/profile/02418758537880869494</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='13156079507590169990'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7570203001448264185' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7570203001448264185' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-331320855480870021</id><published>2008-09-07T02:21:00.000+01:00</published><updated>2008-09-07T02:21:00.000+01:00</updated><title type='text'>You do realize that these two statements are contr...</title><content type='html'>You do realize that these two statements are contradictory, right?&lt;BR/&gt;&lt;BR/&gt;&lt;BR/&gt;EBS does not have the bandwidth of local disks&lt;BR/&gt;&lt;BR/&gt;...&lt;BR/&gt;&lt;BR/&gt;    The bottom line though is that performance exceeds what we’ve seen for filesystems striped across the four local drives of x-large instances.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/331320855480870021'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/331320855480870021'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html?showComment=1220750460000#c331320855480870021' title=''/><author><name>Yousef Ourabi</name><uri>http://yousefourabi.com</uri><email>noreply@blogger.com</email></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7570203001448264185' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7570203001448264185' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-4199698789512654950</id><published>2008-08-26T03:14:00.000+01:00</published><updated>2008-08-26T03:14:00.000+01:00</updated><title type='text'>Thanks Tom, this is very timely and helpful.</title><content type='html'>Thanks Tom, this is very timely and helpful.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/4199698789512654950'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/4199698789512654950'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html?showComment=1219716840000#c4199698789512654950' title=''/><author><name>Allen Day</name><uri>http://spicylogic.com/allenday/blog/</uri><email>noreply@blogger.com</email></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7570203001448264185' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7570203001448264185' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-8329573934305748797</id><published>2008-08-24T10:53:00.000+01:00</published><updated>2008-08-24T10:53:00.000+01:00</updated><title type='text'>EBS is replicated, so HDFS replication is redundan...</title><content type='html'>EBS is replicated, so HDFS replication is redundant.  One could simply set the replication level to one, to inhibit HDFS replication, but then if an EC2 instance were to crash files would be incomplete.  One could instead set the replication level to two, to reduce redundant replicaton, or perhaps leave replication at one, closely monitoring datanode instances and re-mount volumes elsewhere when one dies.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/8329573934305748797'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7570203001448264185/comments/default/8329573934305748797'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html?showComment=1219571580000#c8329573934305748797' title=''/><author><name>Doug Cutting</name><uri>http://www.blogger.com/profile/17556364956409186372</uri><email>noreply@blogger.com</email></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7570203001448264185' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7570203001448264185' type='text/html'/></entry></feed>