In the top left of TOS, just under the Job name, you should see a section marked Metadata. Click the little triangle next to Metadata and then right click on File Delimited:

Click on "Create file delimited" to open the Metadata wizard. Enter a name for your source weather data (this can be anything you like; but no spaces). The other information is optional and once done click Next:

Step 2 identifies the file containing the weather data. Click Browse to locate your CSV file in the file system. The Format should be set to WINDOWS (unless you're using some other operating system). Once the file is located the File Viewer area should look something like the image below:

Click Next to move onto the next step. Step 3 defines how the individual fields in the data file are separated. The most common separator is a comma (hence Comma Separated Values or CSV). However TOS might default to a semi-colon field separator in which case you'll need to change to to a comma:

Here you can also specify whether the file has header or footer rows that should be ignored (Virtual Weather Station data files don't have a header row; but yours might). Once you're happy click the Refresh Preview button and you should see your data neatly formatted into columns as above. Click Next.
The final step in the wizard allows you to specify names for each of the columns of data. You could leave them as Column0, Column1 etc; but if you do you'll find it much harder later when it comes to map the source data onto the database table. This is what Step 4 looks like initially:

Click on each of the Column names and replace them with something meaningful. Here you'll need to know the order of each field in your data file and your weather station software documentation should be able to provide this for you. I know this will take a while; but believe me it will be worth it later. Here I've started to add some column names:

Once you've added all your column names click Finish and your navigation tree should look something like this:

That's the source for our weather data defined. In the next part we'll use that to do something useful.
]]>First off is the data file format. My weather software (Virtual Weather Station) stores its data in a file with each value separated by a comma. This type of file is referred to as a Comma Separated Values, or CSV, file. For this project to work you'll need to
The key field in this case is probably a field containing the date and time. For example in the VWS data file the key field is the first field and contains values in the format YYMMDDHHMI, for example 201003171442. The reason this is the key field is that it uniquely identifies each row of data - you can't have two rows with the same date and time. It's possible your data file might have the date and time split into two fields, in which case your "key" is a combination of the two. In database terms this key is called the Primary Key.
OK, that's enough chit chat, let's get down to some work. The first thing to do is to open Talend Open Studio (TOS). When you first open it you'll see something like this:

Before we can do anything we have to set up a repository by clicking the ... button indicated. This is simply where we'll be storing our project, which in this case will be locally on our PC:

Select a Local repository, enter your email address and choose where you want your workspace. I've put mine under my Documents folder; but you can leave yours in the default location if you wish. Click OK to set up the repository:

Now click the drop down and select "Create a new local project", then click Go!

Enter a name for your project, the description is optional. Then select the Generation language, here we're using Perl; but you can use Java if you want. However they'll be a step later on that is Perl specific so unless you know Java then stick to Perl. Don't be concerned that we'll be writing huge screeds of code, it's only a very small step that needs to be written in Perl. Finally click Finish to create the project.

Click the Open button to open the project in TOS:

Once you have the workspace open this is how it should look. The main areas we'll be working with are:
Lastly let's create a Job. Right click on Job Designs (Repository top left) and select "Create job":

Enter the job details. You have to give the job a name, the rest is optional.

When you click Finish you'll be returned back to TOS, your Palette will be populated with all the many tools you can use and the Job Design area will become an active grid.
That's all for now. In the next installment we'll be creating some data objects and taking our first look at them in TOS.
]]>So what's the job? The requirements were these:
I've a weather station running at home (see my site TulligWeather). The software that logs the weather data writes the information to a "comma separated values" (CSV) data file - one line of data every 5 minutes. My requirement was to find a reliable way to insert this into a database so that I can manipulate it further (e.g. draw charts; but that's another tutorial). Once the data is in a database it's easy to query it to find maximum or minimum values, averages, trends and the like.
If you want to follow this tutorial then you'll need these ingredients:
Now you should have a MySQL database, TOS and Active Perl (including DBD::MySQL) installed on your PC. The next step will be to fire up TOS and create your project, more on this later.
Whilst I'm not a support service for MySQL, TOS or Active Perl installation let me know if you run into problems and I'll see if I can point you in the right direction.
]]>
For a start Sun is a hardware manufacturer with a range of UNIX servers that target corporate IT infrastructure. Oracle, on the other hand, has no experience with box production and the corporate merger landscape is littered with companies that merged into areas they were not familiar with and came down to earth with a bump as a result. I would expect Oracle to hive off the hardware arm of Sun as a separate (and loss making?) business after which IBM can pick over the bones.
The rest of Sun Microsystems is a different prospect for Oracle. Sun own Java, the programming language and runtime that’s become pretty ubiquitous over the last ten years and can now be found on a huge range of every day devices from mobile phones, cars, washing machines; just about anything that needs a control system. Just how Oracle views Java could have far reaching consequences in a huge variety of market places.
Then there’s Sun’s other Open Source product set including the competitor for Microsoft Office; Open Office.org. Open Office has got to the point where it’s a serious alternative to the (very expensive) Office suite from Microsoft, and it won’t cost you a dime. Sun has been providing supporting services (if not hard cash) to the development of Open Office.org. Now I don’t know how much of a sink hole this has been for Sun; but it would be a great pity if Oracle abandon these efforts rather than continuing to promote and support development.
The MySQL database is another big question mark for Oracle. Purchased by Sun a few years ago MySQL could be seen as a direct competitor to Oracle’s database products, again at a fraction of the price. Removing MySQL from the market place could have been a big incentive for Oracle to purchase Sun. However the enterprise version of MySQL is only one part of the equation; the other part is what happens to the community version. For years now this has been the bedrock of many of the innovative open source projects that have made cloud computing what it is today. If Oracle should stop supporting the community edition of MySQL it will leave a huge hole in the open source development effort.
Down the line I predict this take over could have far reaching consequences for the shape of the IT industry. Let us know what your views are in the comments below..
]]>
Imagine this scene. You come into work on a Monday morning to find that there was a fire that started in a neighbouring building over the weekend and your office was badly damaged by heat, smoke and water from the fire brigade. You're stood in the car park wondering what you're going to do to rescue your business.