<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/'><id>tag:blogger.com,1999:blog-8898949683610477251.post7098839651234570081..comments</id><updated>2011-12-07T20:31:03.960Z</updated><category term='Mobile'/><category term='Query Languages'/><category term='Broadband'/><category term='Lucene'/><category term='Visualization'/><category term='MapReduce'/><category term='Cloud Computing'/><category term='Family'/><category term='Regular Expressions'/><category term='Web Services'/><category term='Music'/><category term='Hashing'/><category term='RPC'/><category term='Thrift'/><category term='Java'/><category term='Cloudera'/><category term='Open Source'/><category term='Testing'/><category term='Amazon Web Services'/><category term='Distributed Systems'/><category term='Amazon EC2'/><category term='Amazon S3'/><category term='Conferences'/><category term='Quantum Mechanics'/><category term='Data'/><category term='Whirr'/><category term='Hadoop'/><category term='HBase'/><category term='Hardware'/><category term='Apache'/><category term='Easter'/><category term='Book'/><category term='Serialization'/><title type='text'>Comments on Tom White: Hadoop and Log File Analysis</title><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://www.lexemetech.com/feeds/7098839651234570081/comments/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7098839651234570081/comments/default'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/01/hadoop-and-log-file-analysis.html'/><author><name>Tom White</name><uri>http://www.blogger.com/profile/02418758537880869494</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://farm2.static.flickr.com/1358/822201572_051b33f802_s.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>4</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-2306529600657558230</id><published>2011-06-01T17:58:10.046+01:00</published><updated>2011-06-01T17:58:10.046+01:00</updated><title type='text'>Hey Tony, here&amp;#39;s a fairly easy doc with how to...</title><content type='html'>Hey Tony, here&amp;#39;s a fairly easy doc with how to load your system and app logs into Hadoop (&lt;a href="http://hive.apache.org/" rel="nofollow"&gt;Apache Hive&lt;/a&gt;) for SQL-style analysis:&lt;br /&gt;&lt;a href="http://help.papertrailapp.com/kb/analytics/log-analytics-with-hadoop-and-hive" rel="nofollow"&gt;Log analytics with Hadoop and Hive&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The doc is written for customers of our hosted &amp;quot;log aggregator in the cloud&amp;quot; service, &lt;a href="http://papertrailapp.com/" rel="nofollow"&gt;Papertrail&lt;/a&gt;, but it covers how the data gets there, the formatting (TSV), and Hive&amp;#39;s LOAD DATA INFILE process.&lt;br /&gt;&lt;br /&gt;We&amp;#39;ve been spoiled by Amazon Elastic MapReduce, but Cloudera&amp;#39;s &lt;a href="https://ccp.cloudera.com/display/CDHDOC/Hive+Installation" rel="nofollow"&gt;distro&lt;/a&gt; is very powerful and would work fine.&lt;br /&gt;&lt;br /&gt;Whether or not you&amp;#39;re logging to Papertrail, something like that would be my recommendation for reporting.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7098839651234570081/comments/default/2306529600657558230'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7098839651234570081/comments/default/2306529600657558230'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/01/hadoop-and-log-file-analysis.html?showComment=1306947490046#c2306529600657558230' title=''/><author><name>Troy</name><uri>http://papertrailapp.com/</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img1.blogblog.com/img/blank.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/01/hadoop-and-log-file-analysis.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7098839651234570081' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7098839651234570081' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-662608984'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-3805256354296746088</id><published>2010-04-03T20:23:23.834+01:00</published><updated>2010-04-03T20:23:23.834+01:00</updated><title type='text'>At &lt;a href="http://loggly.com" rel="nofollow"&gt;Logg...</title><content type='html'>At &lt;a href="http://loggly.com" rel="nofollow"&gt;Loggly&lt;/a&gt; we are building a cloud-based log management an analysis platform. You will not only be able to run large-scale processes (through map-reduce), but you can do much more based on our real-time indexing and data processing capabilities. In order to enable you to build your own use-cases around your log data, we provide an API that you can use to access all the data that you have Loggly manage for you. Would be curious to get your feedback!</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7098839651234570081/comments/default/3805256354296746088'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7098839651234570081/comments/default/3805256354296746088'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/01/hadoop-and-log-file-analysis.html?showComment=1270322603834#c3805256354296746088' title=''/><author><name>Raffy</name><uri>http://loggly.com</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img1.blogblog.com/img/blank.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/01/hadoop-and-log-file-analysis.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7098839651234570081' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7098839651234570081' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-371574985'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-1310968834646413361</id><published>2010-03-09T16:26:51.381Z</published><updated>2010-03-09T16:26:51.381Z</updated><title type='text'>Tom, what would be needed for an enterprise log ma...</title><content type='html'>Tom, what would be needed for an enterprise log management system using Hadoop? Can it source any log type including custom? Do I need to build the interfaces? Does Hadoop include a query reporting system? Otherwise, what are your recommendations for query/reporting?</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7098839651234570081/comments/default/1310968834646413361'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7098839651234570081/comments/default/1310968834646413361'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/01/hadoop-and-log-file-analysis.html?showComment=1268152011381#c1310968834646413361' title=''/><author><name>Tony Czarnik</name><uri>http://www.savidtech.com</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img1.blogblog.com/img/blank.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/01/hadoop-and-log-file-analysis.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7098839651234570081' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7098839651234570081' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-592315239'/></entry><entry><id>tag:blogger.com,1999:blog-8898949683610477251.post-3305438287361821514</id><published>2008-02-01T09:23:00.000Z</published><updated>2008-02-01T09:23:00.000Z</updated><title type='text'>Shortly after I wrote this, &lt;a href="http://highsc...</title><content type='html'>Shortly after I wrote this, &lt;A HREF="http://highscalability.com/" REL="nofollow"&gt;High Scalability&lt;/A&gt; covered the story with &lt;A HREF="http://highscalability.com/how-rackspace-now-uses-mapreduce-and-hadoop-query-terabytes-data" REL="nofollow"&gt;extra details&lt;/A&gt; from Mailtrust CTO &lt;A HREF="http://billboebel.typepad.com/blog/" REL="nofollow"&gt;Bill Boebel&lt;/A&gt;. Well worth a read.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7098839651234570081/comments/default/3305438287361821514'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8898949683610477251/7098839651234570081/comments/default/3305438287361821514'/><link rel='alternate' type='text/html' href='http://www.lexemetech.com/2008/01/hadoop-and-log-file-analysis.html?showComment=1201857780000#c3305438287361821514' title=''/><author><name>Tom White</name><uri>http://www.blogger.com/profile/02418758537880869494</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='http://farm2.static.flickr.com/1358/822201572_051b33f802_s.jpg'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://www.lexemetech.com/2008/01/hadoop-and-log-file-analysis.html' ref='tag:blogger.com,1999:blog-8898949683610477251.post-7098839651234570081' source='http://www.blogger.com/feeds/8898949683610477251/posts/default/7098839651234570081' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-2080114506'/></entry></feed>
