<?xml version="1.0" encoding="utf-8"?>
<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title>Data Wrangling Blog - Latest Comments</title><link>http://datawranglingblog.disqus.com/</link><description></description><atom:link href="https://datawranglingblog.disqus.com/comments.rss" rel="self"></atom:link><language>en</language><lastBuildDate>Mon, 18 Nov 2013 10:36:52 -0000</lastBuildDate><item><title>Re: Some Datasets Available on the Web</title><link>http://www.datawrangling.com/some-datasets-available-on-the-web#comment-1128157470</link><description>&lt;p&gt;one more link: &lt;a href="http://endb-consolidated.aihit.com/datasets.htm" rel="nofollow noopener" target="_blank" title="http://endb-consolidated.aihit.com/datasets.htm"&gt;http://endb-consolidated.ai...&lt;/a&gt; random 10,000 worldwide companies sampled from HitCompanies (all data in this DB extracted and updated automatically from WWW using AI and Machine Learning): company name and aliases, company description, industry tags, industry codes, registration numbers, addresses, phone numbers, VAT numbers, website, number of about/contact/management/product pages, incorporation date, team size, number of clients and partners, number of emails, number of key changes (client/partner changes, contact changes, people changes), and many more.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Yuri Burger</dc:creator><pubDate>Mon, 18 Nov 2013 10:36:52 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-316963998</link><description>&lt;p&gt;Did anyone solve this problem?  I had the same problem, and I downloaded and unpacked the libraries, but perhaps the python library needs to be _placed_ somewhere?&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Lukearron</dc:creator><pubDate>Wed, 21 Sep 2011 18:32:52 -0000</pubDate></item><item><title>Re: Some Datasets Available on the Web</title><link>http://www.datawrangling.com/some-datasets-available-on-the-web#comment-281137723</link><description>&lt;p&gt;Where can i find example datasets for inner workings of an insurance company or bank ? &lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Modarres Zadeh</dc:creator><pubDate>Tue, 09 Aug 2011 07:38:02 -0000</pubDate></item><item><title>Re: Some Datasets Available on the Web</title><link>http://www.datawrangling.com/some-datasets-available-on-the-web#comment-281108468</link><description>&lt;p&gt;can any one help me to get a dataset which can describe the effort estimation of web project including the attributes&lt;br&gt; TypeProj- Categorical Type of project (new or enhancement).nLang- Ratio Number of different development languages used.DocProc -Categorical If project followed defined and documented process.ProImpr- Categorical If project team involved in a process improvement programme.Metrics -Categorical If project team part of a software metrics programme.DevTeam- Ratio Size of a project’s development team.TeamExp -Ratio Average team experience with the development language(s) employed.TotEffort -Ratio Actual total effort in person hours used to develop an application.EstEffort -Ratio Estimated total effort in person hours to develop an application.Accuracy -Categorical Procedure used to record effort data.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Sanjay Kushwaha</dc:creator><pubDate>Tue, 09 Aug 2011 06:12:39 -0000</pubDate></item><item><title>Re: Hidden Video Courses in Math, Science, and Engineering</title><link>http://www.datawrangling.com/hidden-video-courses-in-math-science-and-engineering#comment-269610105</link><description>&lt;p&gt;I was aware of many of these, but the UNM Quantum Field Theory is an awesome find.  Thanks.  I now have a new series of videos to work my way through.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Alberto</dc:creator><pubDate>Sat, 30 Jul 2011 21:44:18 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-250815538</link><description>&lt;p&gt;Dear Sir,&lt;/p&gt;&lt;p&gt;  I am trying to use MPI for &lt;br&gt;running Data parallel applications. I read this post and on following up, I have few queries. Would be great, if I can get answers to them.&lt;/p&gt;&lt;p&gt;a) As of now, I am choosing AMI using the browser interface of Amazon. You have uploaded two AMI's one for master (ami-e813f681) and another for slave (ami-eb13f682).&lt;br&gt; When I launch an instance, I can only choose one. Which one should I &lt;br&gt;choose ? (Or how should I choose both, in case I need to)  ? Do these &lt;br&gt;AMI's has OPEMPI implementation ?&lt;/p&gt;&lt;p&gt;b) Secondly, if I launch multiple instances of a Large CPU (say 3 &lt;br&gt;instances), I would be get many nodes (say m nodes per one instance, hence i would have m*3 nodes). Can I communicate between nodes &lt;br&gt;of different instances just as we normally do in MPI ?&lt;/p&gt;&lt;p&gt;c) Are you aware any MPI image with ubuntu ?&lt;/p&gt;&lt;p&gt;d) I am using browser, since I have to connect through proxy. How/Where &lt;br&gt;do set proxy connections, if I need to start cluster using your python &lt;br&gt;scripts ?&lt;/p&gt;&lt;p&gt;Easwar&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Easwar</dc:creator><pubDate>Wed, 13 Jul 2011 10:56:33 -0000</pubDate></item><item><title>Re: Hidden Video Courses in Math, Science, and Engineering</title><link>http://www.datawrangling.com/hidden-video-courses-in-math-science-and-engineering#comment-194037170</link><description>&lt;p&gt;Yes you exactly got the point! There are thousands free video lectures out there, but they are mostly at a very elementary level. Hope more advanced stuff will be available in the future.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Marco</dc:creator><pubDate>Fri, 29 Apr 2011 06:42:57 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-70898888</link><description>&lt;p&gt;Hi,&lt;/p&gt;&lt;p&gt;Does anyone have this running under cygwin in windows?&lt;/p&gt;&lt;p&gt;If so, can you please post your code for &lt;a href="http://ec2-mpi-config.py?" rel="nofollow noopener" target="_blank" title="ec2-mpi-config.py?"&gt;ec2-mpi-config.py?&lt;/a&gt; I tried using the current file but get lots of errors.&lt;/p&gt;&lt;p&gt;Thanks,&lt;br&gt;John&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Johnliu</dc:creator><pubDate>Mon, 23 Aug 2010 23:19:59 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-70014559</link><description>&lt;p&gt;ok, i figured it out. the connection was being made through my default security group where port 22 was not open.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">sumu789</dc:creator><pubDate>Thu, 19 Aug 2010 12:00:08 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-69997433</link><description>&lt;p&gt;I have sucessfully connected to 5 nodes but am having trouble with the &lt;a href="http://ec2-mpi-config.py" rel="nofollow noopener" target="_blank" title="ec2-mpi-config.py"&gt;ec2-mpi-config.py&lt;/a&gt; script. When I run it, I repeatedly get the following:&lt;/p&gt;&lt;p&gt;---- MPI Cluster Details ----&lt;br&gt;Numer of nodes = 5&lt;br&gt;Instance= i-8d2949e7 external_name = &lt;a href="http://ec2-184-72-183-113.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-184-72-183-113.compute-1.amazonaws.com"&gt;ec2-184-72-183-113.compute-...&lt;/a&gt; hostname= ip-10-212-239-33.ec2.internal state= &lt;br&gt;Instance= i-832949e9 external_name = &lt;a href="http://ec2-174-129-138-43.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-174-129-138-43.compute-1.amazonaws.com"&gt;ec2-174-129-138-43.compute-...&lt;/a&gt; hostname= domU-12-31-39-10-6C-13.compute-1.internal state= &lt;br&gt;Instance= i-812949eb external_name = &lt;a href="http://ec2-184-72-141-241.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-184-72-141-241.compute-1.amazonaws.com"&gt;ec2-184-72-141-241.compute-...&lt;/a&gt; hostname= domU-12-31-39-0B-00-F8.compute-1.internal state= &lt;br&gt;Instance= i-872949ed external_name = &lt;a href="http://ec2-174-129-131-171.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-174-129-131-171.compute-1.amazonaws.com"&gt;ec2-174-129-131-171.compute...&lt;/a&gt; hostname= domU-12-31-39-09-C4-24.compute-1.internal state= &lt;br&gt;Instance= i-852949ef external_name = &lt;a href="http://ec2-174-129-61-151.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-174-129-61-151.compute-1.amazonaws.com"&gt;ec2-174-129-61-151.compute-...&lt;/a&gt; hostname= domU-12-31-39-0C-D8-57.compute-1.internal state= &lt;br&gt;5&lt;/p&gt;&lt;p&gt;The master node is &lt;a href="http://ec2-184-72-183-113.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-184-72-183-113.compute-1.amazonaws.com"&gt;ec2-184-72-183-113.compute-...&lt;/a&gt;&lt;/p&gt;&lt;p&gt;Writing out mpd.hosts file&lt;/p&gt;&lt;p&gt;scp -i id_rsa-gsg-keypair -o "StrictHostKeyChecking no" id_rsa-gsg-keypair root@ec2&lt;a href="http://-184-72-183-113.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="-184-72-183-113.compute-1.amazonaws.com"&gt;-184-72-183-113.compute-1.a...&lt;/a&gt;:~/.ssh/id_rsa-gsg-keypair&lt;/p&gt;&lt;p&gt;ssh: connect to host &lt;a href="http://ec2-184-72-183-113.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-184-72-183-113.compute-1.amazonaws.com"&gt;ec2-184-72-183-113.compute-...&lt;/a&gt; port 22: Connection timed out&lt;br&gt;lost connection&lt;/p&gt;&lt;p&gt;ssh -o "StrictHostKeyChecking no" root@ec2&lt;a href="http://-184-72-183-113.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="-184-72-183-113.compute-1.amazonaws.com"&gt;-184-72-183-113.compute-1.a...&lt;/a&gt; "touch .ssh/authorized_keys"&lt;/p&gt;&lt;p&gt;ssh: connect to host &lt;a href="http://ec2-184-72-183-113.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-184-72-183-113.compute-1.amazonaws.com"&gt;ec2-184-72-183-113.compute-...&lt;/a&gt; port 22: Connection timed out&lt;/p&gt;&lt;p&gt;ssh -o "StrictHostKeyChecking no" root@ec2&lt;a href="http://-184-72-183-113.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="-184-72-183-113.compute-1.amazonaws.com"&gt;-184-72-183-113.compute-1.a...&lt;/a&gt; "cp -r .ssh /home/lamuser/"&lt;/p&gt;&lt;p&gt;It seems I'm having a connection problem. Does anyone what I can do about this?&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Supriyamunshaw</dc:creator><pubDate>Thu, 19 Aug 2010 10:43:44 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-69464830</link><description>&lt;p&gt;Can someone please help? &lt;a href="http://mc2-mpi-config.py" rel="nofollow noopener" target="_blank" title="mc2-mpi-config.py"&gt;mc2-mpi-config.py&lt;/a&gt; is giving the following error:&lt;/p&gt;&lt;p&gt;---- MPI Cluster Details ----&lt;br&gt;Numer of nodes = 2&lt;br&gt;Instance= i-bf90f5d5 external_name = &lt;a href="http://ec2-184-73-36-216.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-184-73-36-216.compute-1.amazonaws.com"&gt;ec2-184-73-36-216.compute-1...&lt;/a&gt; hostname= ip-10-242-1&lt;br&gt;18-239.ec2.internal state= running&lt;br&gt;Instance= i-bd90f5d7 external_name = &lt;a href="http://ec2-174-129-77-216.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-174-129-77-216.compute-1.amazonaws.com"&gt;ec2-174-129-77-216.compute-...&lt;/a&gt; hostname= ip-10-242-&lt;br&gt;117-139.ec2.internal state= running&lt;/p&gt;&lt;p&gt;The master node is &lt;a href="http://ec2-184-73-36-216.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-184-73-36-216.compute-1.amazonaws.com"&gt;ec2-184-73-36-216.compute-1...&lt;/a&gt;&lt;/p&gt;&lt;p&gt;Writing out mpd.hosts file&lt;br&gt;Traceback (most recent call last):&lt;br&gt;  File "&lt;a href="http://ec2-mpi-config.py" rel="nofollow noopener" target="_blank" title="ec2-mpi-config.py"&gt;ec2-mpi-config.py&lt;/a&gt;", line 210, in &amp;lt;module&amp;gt;&lt;br&gt;    sys.exit(main())&lt;br&gt;  File "&lt;a href="http://ec2-mpi-config.py" rel="nofollow noopener" target="_blank" title="ec2-mpi-config.py"&gt;ec2-mpi-config.py&lt;/a&gt;", line 65, in main&lt;br&gt;    configure()&lt;br&gt;  File "&lt;a href="http://ec2-mpi-config.py" rel="nofollow noopener" target="_blank" title="ec2-mpi-config.py"&gt;ec2-mpi-config.py&lt;/a&gt;", line 151, in configure&lt;br&gt;    rsakeys = open(homedir + "/.ssh/id_rsa", 'r').read()&lt;br&gt;IOError: [Errno 2] No such file or directory: 'C:\\Documents and Settings\\Yunzhi Ma/.ssh/id_rsa'&lt;/p&gt;&lt;p&gt;Could this possibly be due to anything about chunk and parsed_response? I printed out parsedresponse:&lt;/p&gt;&lt;p&gt; [['RESERVATION', 'r-30733b5b', '219669225938', 'default'], ['INSTANCE', 'i-bf90f5d5', 'ami-e813f681'&lt;br&gt;, '&lt;a href="http://ec2-184-73-36-216.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="ec2-184-73-36-216.compute-1.amazonaws.com"&gt;ec2-184-73-36-216.compute-1...&lt;/a&gt;', 'ip-10-242-118-239.ec2.internal', 'running'], ['RESER&lt;br&gt;VATION', 'r-36733b5d', '219669225938', 'default'], ['INSTANCE', 'i-bd90f5d7', 'ami-eb13f682', 'ec2-1&lt;br&gt;&lt;a href="http://74-129-77-216.compute-1.amazonaws.com" rel="nofollow noopener" target="_blank" title="74-129-77-216.compute-1.amazonaws.com"&gt;74-129-77-216.compute-1.ama...&lt;/a&gt;', 'ip-10-242-117-139.ec2.internal', 'running']]&lt;/p&gt;&lt;p&gt;Thanks so much,&lt;br&gt;Henry&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Henrywang41</dc:creator><pubDate>Tue, 17 Aug 2010 21:51:57 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-69256901</link><description>&lt;p&gt;Hi,&lt;/p&gt;&lt;p&gt;Amazon just recently (last month) released a cloud computing instance (&lt;a href="http://developer.amazonwebservices.com/connect/ann.jspa?annID=718)" rel="nofollow noopener" target="_blank" title="http://developer.amazonwebservices.com/connect/ann.jspa?annID=718)"&gt;http://developer.amazonwebs...&lt;/a&gt;&lt;/p&gt;&lt;p&gt;Does the code and what you describe here work for this new released instance for HPC? (It's CentOS HVM AMI, ami-7ea24a17 under U.S. East)&lt;/p&gt;&lt;p&gt;Thanks,&lt;/p&gt;&lt;p&gt;Henry&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Henrywang41</dc:creator><pubDate>Mon, 16 Aug 2010 23:56:20 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-69254750</link><description>&lt;p&gt;Hi,&lt;/p&gt;&lt;p&gt;Is Elasticwulf and MPI essentially the same thing? I'm trying to run some high performance computing using an Amazon EC2 cluster.&lt;/p&gt;&lt;p&gt;Also, is boto necessary to set up a cluster on amazon EC2? What's the difference between boto and Elasticwulf?&lt;/p&gt;&lt;p&gt;Thanks,&lt;/p&gt;&lt;p&gt;Henry&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Henrywang41</dc:creator><pubDate>Mon, 16 Aug 2010 23:44:05 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-69243905</link><description>&lt;p&gt;Hi,&lt;/p&gt;&lt;p&gt;What exactly is the difference between Elasticwulf and MPI? Are they the same thing? I'm trying to launch a cluster for HPC, which one is more suitable?&lt;/p&gt;&lt;p&gt;Also, is boto necessary too for launching a cluster?&lt;/p&gt;&lt;p&gt;Thanks,&lt;br&gt;Henry&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Henrywang41</dc:creator><pubDate>Mon, 16 Aug 2010 22:30:14 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-68258537</link><description>&lt;p&gt;I think you need to get python library :)&lt;/p&gt;&lt;p&gt;Adeel Jan.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Adeel Jan</dc:creator><pubDate>Thu, 12 Aug 2010 17:25:53 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-67194196</link><description>&lt;p&gt;Hi,&lt;/p&gt;&lt;p&gt;I was trying to launch a cluster using the given scripts. I got the following error :&lt;/p&gt;&lt;p&gt;# ./&lt;a href="http://ec2-start-cluster.py" rel="nofollow noopener" target="_blank" title="ec2-start-cluster.py"&gt;ec2-start-cluster.py&lt;/a&gt; &lt;br&gt;Traceback (most recent call last):&lt;br&gt;  File "./&lt;a href="http://ec2-start-cluster.py" rel="nofollow noopener" target="_blank" title="ec2-start-cluster.py"&gt;ec2-start-cluster.py&lt;/a&gt;", line 23, in &amp;lt;module&amp;gt;&lt;br&gt;    import EC2&lt;br&gt;ImportError: No module named EC2&lt;/p&gt;&lt;p&gt;Can anyone help me out ?&lt;br&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Aarthi 8288</dc:creator><pubDate>Sun, 08 Aug 2010 11:32:07 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-65945167</link><description>&lt;p&gt;Little late on the thread here but would still like some feedback.&lt;/p&gt;&lt;p&gt;My issues:&lt;/p&gt;&lt;p&gt;1. I was also being prompted for my password. I ended up using the solution that Raghave suggested.&lt;/p&gt;&lt;p&gt;2. I am not seeing /usr/local/src/pyMPI-2.4b2/. That directory doesn't appear to be present. I tried to get around this by copying in &lt;a href="http://fractal.py" rel="nofollow noopener" target="_blank" title="fractal.py"&gt;fractal.py&lt;/a&gt; from my local machine. I end up with the following:&lt;/p&gt;&lt;p&gt;mpirun -np 2 pyMPI /home/lamuser/&lt;a href="http://fractal.py" rel="nofollow noopener" target="_blank" title="fractal.py"&gt;fractal.py&lt;/a&gt; &lt;br&gt;pyMPI: can't open file '/home/lamuser/&lt;a href="http://fractal.py" rel="nofollow noopener" target="_blank" title="fractal.py"&gt;fractal.py&lt;/a&gt;': [Errno 2] No such file or directory&lt;/p&gt;&lt;p&gt;&lt;a href="http://fractal.py" rel="nofollow noopener" target="_blank" title="fractal.py"&gt;fractal.py&lt;/a&gt; is in that directory.&lt;/p&gt;&lt;p&gt;Advice?&lt;br&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">bearrito</dc:creator><pubDate>Tue, 03 Aug 2010 20:50:29 -0000</pubDate></item><item><title>Re: Wikipedia Page Traffic Statistics Dataset</title><link>http://www.datawrangling.com/wikipedia-page-traffic-statistics-dataset#comment-46894856</link><description>&lt;p&gt;Where can i get the number of views for each article. I want to get the views for many articles.  Is there any such data set available? &lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Vinod</dc:creator><pubDate>Tue, 27 Apr 2010 05:26:16 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-45550297</link><description>&lt;p&gt;I have a python script that calls a program installed on the master node and slave nodes.&lt;br&gt;Can I run it from the master node and get the results with mpirun?&lt;/p&gt;&lt;p&gt;Thanks a lot.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">miccloud</dc:creator><pubDate>Mon, 19 Apr 2010 17:57:03 -0000</pubDate></item><item><title>Re: MPI Cluster with Python and Amazon EC2 (part 2 of 3)</title><link>http://www.datawrangling.com/mpi-cluster-with-python-and-amazon-ec2-part-2-of-3#comment-45371085</link><description>&lt;p&gt;Can I execute bash script in a node  with a job?&lt;/p&gt;&lt;p&gt;Thanks.&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">miccloud</dc:creator><pubDate>Sun, 18 Apr 2010 11:27:07 -0000</pubDate></item><item><title>Re: Wikipedia Page Traffic Statistics Dataset</title><link>http://www.datawrangling.com/wikipedia-page-traffic-statistics-dataset#comment-38711580</link><description>&lt;p&gt;good job!&lt;/p&gt;&lt;p&gt;but how did you get the stats of wikipedia?&lt;/p&gt;&lt;p&gt;greetz, tobe&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">tobe</dc:creator><pubDate>Tue, 09 Mar 2010 13:55:25 -0000</pubDate></item><item><title>Re: Some Datasets Available on the Web</title><link>http://www.datawrangling.com/some-datasets-available-on-the-web#comment-32777719</link><description>&lt;p&gt;hi &lt;br&gt;can anyone tell me how i can obtain data sets for movie ratings to import into WEKA&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Vinoth</dc:creator><pubDate>Fri, 05 Feb 2010 15:50:47 -0000</pubDate></item><item><title>Re: Hidden Video Courses in Math, Science, and Engineering</title><link>http://www.datawrangling.com/hidden-video-courses-in-math-science-and-engineering#comment-27728536</link><description>&lt;p&gt;Thank you very much for your great effort. Really appreciate!&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">kittipat kampa</dc:creator><pubDate>Fri, 01 Jan 2010 04:53:23 -0000</pubDate></item><item><title>Re: Some Datasets Available on the Web</title><link>http://www.datawrangling.com/some-datasets-available-on-the-web#comment-27722017</link><description>&lt;p&gt;i need Boolean dataset for association mining for my MTech project.Please provide me the address for the same. It will be a great help&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">manoj</dc:creator><pubDate>Fri, 01 Jan 2010 01:03:12 -0000</pubDate></item><item><title>Re: Wikipedia Page Traffic Statistics Dataset</title><link>http://www.datawrangling.com/wikipedia-page-traffic-statistics-dataset#comment-27101607</link><description>&lt;p&gt;The easiest way would be to access the data from a Linux instance like Ubuntu... With some legwork, you should be able to use Samba somehow to access the volume from Windows - I try to stay away from Windows these days, too many headaches:  &lt;a href="http://polishlinux.org/linux/ext3-reiserfs-xfs-in-windows-thanks-to-colinux/" rel="nofollow noopener" target="_blank" title="http://polishlinux.org/linux/ext3-reiserfs-xfs-in-windows-thanks-to-colinux/"&gt;http://polishlinux.org/linu...&lt;/a&gt;&lt;/p&gt;</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Pete Skomoroch</dc:creator><pubDate>Wed, 23 Dec 2009 14:36:03 -0000</pubDate></item></channel></rss>