Lib-gdal » History » Version 3
Herve Caumont, 2013-11-05 16:46
1 | 1 | Herve Caumont | h1. GDAL Simple job |
---|---|---|---|
2 | |||
3 | h2. Introduction |
||
4 | |||
5 | This simple job will demonstrate a few things about a typical sandbox usage: |
||
6 | * installing software packages |
||
7 | * configuration of a simple application taking geotiff files available on a remote FTP site and convert them to a chosen format (e.g. PNG) |
||
8 | |||
9 | h2. Pre-requisites |
||
10 | |||
11 | To follow this simple tutorial you need: |
||
12 | * a running sandbox |
||
13 | * access to the sandbox |
||
14 | less than 30 minutes of time |
||
15 | |||
16 | h2. Step 1: install gdal on the sandbox |
||
17 | |||
18 | The installation of gdal is done via yum(Yellowdog Updater Modified) |
||
19 | |||
20 | Run the command below on your sandbox: |
||
21 | |||
22 | <pre> |
||
23 | [user@sb ~]$ sudo yum -y install gdal |
||
24 | </pre> |
||
25 | |||
26 | h2. Step 2: create the simplejob |
||
27 | |||
28 | !src/logo.png! |
||
29 | |||
30 | The simple job will require a folder under /application: |
||
31 | |||
32 | <pre> |
||
33 | [user@sb ~] cd /application |
||
34 | [user@sb application] mkdir simplejob |
||
35 | [user@sb application] cd simplejob |
||
36 | </pre> |
||
37 | |||
38 | h3. Create the wrapper file which handles the input parameters |
||
39 | |||
40 | <pre> |
||
41 | [user@sb simplejob] vi run |
||
42 | </pre> |
||
43 | |||
44 | Note: You can use any editor you like |
||
45 | |||
46 | h3. Paste the code below in the run file: |
||
47 | |||
48 | <pre> |
||
49 | #!/bin/bash |
||
50 | |||
51 | # Project: ${project.name} |
||
52 | # Author: $Author: stripodi $ (Terradue Srl) |
||
53 | # Last update: $Date: 2011-09-08 12:01:58 +0200 (Thu, 08 Sep 2011) $ |
||
54 | # Element: ${project.name} |
||
55 | # Context: services/${project.artifactId} |
||
56 | # Version: ${project.version} (${implementation.build}) |
||
57 | # Description: ${project.description} |
||
58 | # |
||
59 | # This document is the property of Terradue and contains information directly |
||
60 | # resulting from knowledge and experience of Terradue. |
||
61 | # Any changes to this code is forbidden without written consent from Terradue Srl |
||
62 | # |
||
63 | # Contact: info@terradue.com |
||
64 | # 2012-02-10 - NEST in jobConfig upgraded to version 4B-1.1 |
||
65 | |||
66 | # source the ciop functions (e.g. ciop-log) |
||
67 | source ${ciop_job_include} |
||
68 | |||
69 | # define the exit codes |
||
70 | SUCCESS=0 |
||
71 | ERR_NOINPUT=1 |
||
72 | ERR_NOPARAMS=2 |
||
73 | ERR_GDAL=4 |
||
74 | |||
75 | # add a trap to exit gracefully |
||
76 | function cleanExit () |
||
77 | { |
||
78 | local retval=$? |
||
79 | local msg="" |
||
80 | case "$retval" in |
||
81 | $SUCCESS) msg="Processing successfully concluded";; |
||
82 | $ERR_NOPARAMS) msg="Outout format not defined";; |
||
83 | $ERR_GDAL) msg="Graph processing of job ${JOBNAME} failed (exit code $res)";; |
||
84 | *) msg="Unknown error";; |
||
85 | esac |
||
86 | [ "$retval" != "0" ] && ciop-log "ERROR" "Error $retval - $msg, processing aborted" || ciop-log "INFO" "$msg" |
||
87 | exit $retval |
||
88 | } |
||
89 | trap cleanExit EXIT |
||
90 | |||
91 | # retrieve the parameters value from workflow or job default value |
||
92 | format=`ciop-getparam format` |
||
93 | |||
94 | # run a check on the format value |
||
95 | [ -z "$format" ] && exit $ERR_NOPARAMS |
||
96 | |||
97 | # loop through all geotiff URLs passed as stdin |
||
98 | while read inputfile |
||
99 | do |
||
100 | # report activity in log |
||
101 | ciop-log "INFO" "Retrieving $inputfile from storage" |
||
102 | |||
103 | # retrieve the remote geotiff product to the local temporary folder |
||
104 | retrieved=`ciop-copy -o $TMPDIR $inputfile` |
||
105 | |||
106 | # check if the file was retrieved |
||
107 | [ "$?" == "0" -a -e "$retrieved" ] || exit $ERR_NOINPUT |
||
108 | |||
109 | # report activity |
||
110 | ciop-log "INFO" "Retrieved $retrieved" |
||
111 | |||
112 | # invoke gdal to convert the geotiff into selected format |
||
113 | gdal_translate -of $format $retrieved $OUTPUTDIR/`basename $retrieved` |
||
114 | |||
115 | # check error code |
||
116 | [ "$?" != "0" ] && exit $ERR_GDAL || ciop-log "INFO" "Processed $inputfile" |
||
117 | done |
||
118 | |||
119 | exit 0 |
||
120 | </pre> |
||
121 | |||
122 | h3. Create the application descriptor |
||
123 | |||
124 | Go up one level to /application |
||
125 | |||
126 | <pre> |
||
127 | [user@sb ~] cd /application |
||
128 | [user@sb application] vi application.xml |
||
129 | </pre> |
||
130 | |||
131 | Paste the XML content: |
||
132 | |||
133 | <pre> |
||
134 | <?xml version="1.0" encoding="UTF-8"?> |
||
135 | <application id="example"> <!-- you can type any id you want --> |
||
136 | <jobTemplates> |
||
137 | <jobTemplate id="gdalformatconv"> <!-- this is the job name --> |
||
138 | <streamingExecutable>/application/simplejob/run</streamingExecutable> <!-- this is the wrapper script --> |
||
139 | <defaultParameters> |
||
140 | <parameter id="format">PNG</parameter> <!-- this sets the default value for parameter format --> |
||
141 | </defaultParameters> |
||
142 | </jobTemplate> |
||
143 | </jobTemplates> |
||
144 | <workflow id="workflow"> <!-- Sample workflow --> |
||
145 | <workflowVersion>1.0</workflowVersion> |
||
146 | <workflowDescription>My simple workflow</workflowDescription> <!-- provide a description to the workflow --> |
||
147 | <node id="gdal"> <!-- workflow node unique id --> |
||
148 | <job id="gdalformatconv"></job> <!-- job defined above --> |
||
149 | <sources> |
||
150 | <source refid="file:urls" >/home/fbrito/geotiff.urls</source> <!-- the geotiff URLs are provided on an ASCII file, set your username value --> |
||
151 | </sources> |
||
152 | <parameters></parameters> |
||
153 | </node> |
||
154 | </workflow> |
||
155 | </application> |
||
156 | </pre> |
||
157 | |||
158 | Create the URLs files in your home directory |
||
159 | |||
160 | <pre> |
||
161 | [user@sb application] cd |
||
162 | [user@sb ~] vi geotiff.urls |
||
163 | </pre> |
||
164 | |||
165 | Add a few URLs: |
||
166 | |||
167 | <pre> |
||
168 | ftp://ftp.remotesensing.org/pub/geotiff/samples/spot/chicago/SP27GTIF.TIF |
||
169 | ftp://ftp.remotesensing.org/pub/geotiff/samples/spot/chicago/UTM2GTIF.TIF |
||
170 | </pre> |
||
171 | |||
172 | h2. Step 3: execute the job |
||
173 | |||
174 | h3. Execute the simple job |
||
175 | |||
176 | Optionally list the nodes you can execute: |
||
177 | |||
178 | <pre> |
||
179 | [user@sb ~] ciop-simjob -n |
||
180 | </pre> |
||
181 | |||
182 | This will return: |
||
183 | <pre> |
||
184 | gdal |
||
185 | </pre> |
||
186 | |||
187 | Process it! |
||
188 | |||
189 | <pre> |
||
190 | [user@sb ~] ciop-simjob gdal |
||
191 | </pre> |
||
192 | |||
193 | The job is executed and a tracking URL is provided to follow the progress and access the execution logs. Open the URL on a browser |
||
194 | |||
195 | TBC |
||
196 | |||
197 | h3. Check the gdal node results |
||
198 | |||
199 | The generated files are published on a local filesystem: |
||
200 | |||
201 | <pre> |
||
202 | [user@sb ~] cd /share/tmp/TBC |
||
203 | </pre> |
||
204 | |||
205 | There should be two image files in the file format defined by default: PNG |
||
206 | |||
207 | h3. Define another file format to test the node |
||
208 | |||
209 | Edit the application.xml file and add the line: |
||
210 | |||
211 | <pre> |
||
212 | <parameter id="format">JPEG</parameter> |
||
213 | </pre> |
||
214 | |||
215 | To obtain: |
||
216 | |||
217 | <pre> |
||
218 | <?xml version="1.0" encoding="UTF-8"?> |
||
219 | <application id="example"> |
||
220 | <jobTemplates> |
||
221 | <jobTemplate id="gdalformatconv"> |
||
222 | <streamingExecutable>/application/simplejob/run</streamingExecutable> |
||
223 | <defaultParameters> |
||
224 | <parameter id="format">PNG</parameter> |
||
225 | </defaultParameters> |
||
226 | </jobTemplate> |
||
227 | </jobTemplates> |
||
228 | <workflow id="workflow"> <!-- Sample workflow --> |
||
229 | <workflowVersion>1.0</workflowVersion> |
||
230 | <workflowDescription>My simple workflow</workflowDescription> |
||
231 | <node id="gdal"> <!-- workflow node unique id --> |
||
232 | <job id="gdalformatconv"></job> <!-- job defined above --> |
||
233 | <sources> |
||
234 | <source refid="file:urls" >/home/fbrito/geotiff.urls</source> |
||
235 | </sources> |
||
236 | <parameters> |
||
237 | <parameter id="format">JPEG</parameter> |
||
238 | </parameters> |
||
239 | </node> |
||
240 | </workflow> |
||
241 | </application> |
||
242 | </pre> |
||
243 | |||
244 | Run the job again this time using the ciop-simjob flag to delete the previous run results: |
||
245 | |||
246 | <pre> |
||
247 | [user@sb ~] ciop-simjob -f gdal |
||
248 | </pre> |
||
249 | |||
250 | The generated files are published on a local filesystem: |
||
251 | |||
252 | <pre> |
||
253 | [user@sb ~] cd /share/tmp/TBC |
||
254 | </pre> |
||
255 | |||
256 | There should be two image files in the file format defined at workflow level: JPEG |
||
257 | |||
258 | h3. Run the application as a workflow |
||
259 | |||
260 | Our workflow has a single job but it's still a workflow! |
||
261 | |||
262 | You can trigger the workflow with: |
||
263 | |||
264 | <pre> |
||
265 | [user@sb ~] ciop-simwf |
||
266 | </pre> |
||
267 | |||
268 | You can track the workflow execution on the shell. Wait for the workflow conclusion. |
||
269 | |||
270 | Use the command below to get the latest workflow run: |
||
271 | <pre> |
||
272 | [user@sb ~] ciop-simwf -l |
||
273 | 0000000-130405042430716-oozie-oozi-W |
||
274 | </pre> |
||
275 | |||
276 | Use the value returned above to check the workflow results: |
||
277 | |||
278 | <pre> |
||
279 | [user@sb ~] ll tmp/sandbox/run/0000000-130405042430716-oozie-oozi-W/gdal/output |
||
280 | </pre> |
||
281 | |||
282 | You can optionally delete the generated results: |
||
283 | |||
284 | |||
285 | |||
286 | h2. Conclusion |
||
287 | |||
288 | With this simple job you have learned: |
||
289 | * how to install packages to run your application using yum |
||
290 | * how to create a job template in your sanbox |
||
291 | * how to write the job wrapper script |
||
292 | * how to define the application descriptor |
||
293 | * how to execute the job with the default parameter |
||
294 | * how to execute the job with the parameter value definition |
||
295 | * how to execute the workflow |
||
296 | * how to check the generated results |