direkt zum Inhalt springen

direkt zum Hauptnavigationsmenü

Sie sind hier

TU Berlin

Page Content

Reference Number: IV-367/20

Research Assist­ant - FONDA S1: DAW Repositories

The Col­lab­or­at­ive Research Cen­ter Found­a­tions of Work­flows for Large-Scale Sci­entific Data Ana­lysis (FONDA) will develop new meth­ods to sup­port sci­ent­ists, who use cluster infra­struc­tures to ana­lyze very large data­sets.

Working Field

Large-scale data ana­lysis sys­tems routinely require sci­ent­ists to work with a mul­ti­tude of hard­ware plat­forms, soft­ware sys­tems, and repos­it­or­ies to be able to ana­lyze given data­sets effi­ciently. For this reason, FONDA is build­ing a joint infra­struc­ture for all pro­ject mem­bers that enables the exe­cu­tion of data ana­lytic work­flows (DAWs), col­lects pro­fil­ing data from these work­flows and any other glob­ally avail­able exe­cu­tions, and makes this data avail­able for research. The wide range of tasks for the envi­sioned sci­entific coordin­ator of this infra­struc­ture is going to be highly inter­dis­cip­lin­ary and will include sup­port­ing CRC mem­bers in access­ing the TUB infra­struc­tures, the install­a­tion and exe­cu­tion of DAWs dur­ing devel­op­ment of DAWs and spe­cific exper­i­ments, and cre­at­ing the new pro­fil­ing data shar­ing sys­tem. The repos­it­or­ies of this sys­tem must also be set up and pop­u­lated with DAW traces. The rel­ev­ant know­ledge has to be provided to all FONDA mem­bers through lec­tures and prac­tical work­shops. Project website: here.


    • Suc­cess­fully com­pleted sci­entific uni­versity degree (Mas­ter, Dip­lom or equi­val­ent) in Com­puter Sci­ence
    • PhD degree wel­come
    • Good under­stand­ing of cluster infra­struc­tures, scal­able data pro­cessing/ana­lysis, sci­entific work­flows, and soft­ware devel­op­ment envir­on­ments
    • Good know­ledge of cluster resource man­age­ment with sys­tems like YARN, Mesos, and Kuber­netes as well as con­tain­ers/Docker
    • Exper­i­ence with dis­trib­uted data pro­cessing sys­tems like Spark, Flink, and Tensor­Flow
    • Excel­lent com­mand of Ger­man and Eng­lish
    • Desir­able: exper­i­ence in research; sci­entific present­a­tions; agile pro­ject man­age­ment and iter­at­ive soft­ware devel­op­ment meth­ods; server admin­is­tra­tion

    TU Berlin job ticket here.

      How to apply

      Please send your applic­a­tion with the ref­er­ence num­ber and the usual doc­u­ments (in par­tic­u­lar CV, list of grades, lan­guage cer­ti­fic­ates) prefer­ably by email to Prof. Odej Kao: odej.kao[at]tu-berlin.de.

      Zusatzinformationen / Extras

      Quick Access:

      Schnellnavigation zur Seite über Nummerneingabe

      Auxiliary Functions


      Odej Kao
      +49 30 314-25154 (Sekr.)
      Room TEL 1206/7