| | 290 | |
| | 291 | == Other Developments == |
| | 292 | |
| | 293 | === Quattor Asset Database - C. Tryantafyllydys === |
| | 294 | |
| | 295 | AUTH needed a tool to ease everyday tasks for new sysadmins and allow to feed other tools with Quattor data |
| | 296 | * Eg. configure a monitoring system from Quattor |
| | 297 | |
| | 298 | QAD main characteristics: |
| | 299 | * Written in Ruby |
| | 300 | * A few schema extensions: /monitiroing, network "parent" (switch it depends on), rack |
| | 301 | * Need to be merged with CERN similar extensions |
| | 302 | * /monitoring is an alternative to the LEMON /monitoring |
| | 303 | * Not yet decided: access to SVN? read or write? |
| | 304 | * Currently using a post-commit hook to synchronize QAD db with SCDB deploys: impact on performance |
| | 305 | * Would like to remove this in the future and rely on XML but some high level information is currently missing: need to extend the schema |
| | 306 | * Allow to change some node characteristics: OS, IP... and redeploy |
| | 307 | * From the config db, can list nodes attached to a specific switch: rely on configured information |
| | 308 | * Can list all the VMs attached to a host (using the enclosure definition) |
| | 309 | |
| | 310 | Need to make progress on using CDB2SQL back-end for producing/maintaining db and use it in all tools that need a build a db from the XML |
| | 311 | * Rename XML2SQL |
| | 312 | * CERN will provide an update on the status and the best version to start from |
| | 313 | * May look at document-oriented dbs, liek CacheDB |
| | 314 | |
| | 315 | === Remote Configuration with Quattor - N. Williams === |
| | 316 | |
| | 317 | ''Disclaimer: doesn't currently work with SCDB (because of dependency over CDB notification).'' |
| | 318 | |
| | 319 | Goal: apply Quattor benefits to boxes that cannot run Quattor client by using delegation |
| | 320 | * A Linux box will acts as a delegate where a configuration module will execute appropriate configuration commands |
| | 321 | * Combination of AII and and CCM/NCM/NCD |
| | 322 | |
| | 323 | Current implementation: |
| | 324 | * quattor-remote-dispatcher (QRD): a tool running on Linux box and receiving CDB notification messages. It acts as a replacement for listend, cdispd, ccm-fetch, ncm-ncd |
| | 325 | * Configuration allows to define which part of the configuration are listened and what is the command to run |
| | 326 | * Can use different commands on different sets of nodes |
| | 327 | * Can define constraints on some part of the configuration to do different things based on some configuration state (for example state=build or production) |
| | 328 | * quattor-remote-configure: an AII equivalent allowing to produce a new configuration for the managed box and notify the remote dispatcher |
| | 329 | * Configuration of the managed device is under /software/components |
| | 330 | * The actual component to use is defined by /system/components/namespace: default is NCM::NCD:: but can be defined to something specific to a device, eg. ESX, Netapp... |
| | 331 | |
| | 332 | Used to manage virtualization in Aquilon (for a WMware Hypervisor) |
| | 333 | * VMs are not associated with actual hosts (handled for example by VMware) but with clusters |
| | 334 | * A cluster is a group of hosts running an hypervisor. There is an object template for each cluster. |
| | 335 | * Each virtual host (machine running an hypervisor) has also an object profile that allows its configuration out of the box |
| | 336 | * VMs are managed as normal machines |
| | 337 | |
| | 338 | Plan to release this as soon as it is polished but will not be able to release the specific components as they are very MS specifics and sometimes rely on non public APIs. |
| | 339 | |
| | 340 | === Virtualisation - D. O'Callaghan === |
| | 341 | |
| | 342 | Currently used Hypervisors configured with Quattor: |
| | 343 | * Xen: TCD, NIKHEF, CERN, AUTH |
| | 344 | * OpenVZ: UAM |
| | 345 | * VMware: MS |
| | 346 | |
| | 347 | Also in use but not managed by Quattor: |
| | 348 | * KVM: CNAF |
| | 349 | * Hyper-V: CERN |
| | 350 | |
| | 351 | VM cluster managers of interest: Platform/VMO, OpenNebula |
| | 352 | |
| | 353 | Issues and requests: |
| | 354 | * Use libvirt, common configuration module for Linux-base hypervisors |
| | 355 | * Could look at what is done by Puppet or other config tools |
| | 356 | * Creation of images |
| | 357 | |
| | 358 | QWG templates for basic virtualisation: mainly used at TCD (also in Senegal!) |
| | 359 | * Rely on ncm-xen, would be great if we had support KVM, probably using libvirt |
| | 360 | |
| | 361 | === Integration with Monitoring Systems - C. Tryantafyllydys === |
| | 362 | |
| | 363 | Porblem: current Nagios configuration in QWG relies on everything being described (in particular probles) in the configuration. But EGEE and OAT developped NCG to do it more dynamically on a specific node. How to take advantage of this? |
| | 364 | |
| | 365 | NCD: generic tool to define Nagios configuration based on context |
| | 366 | * 2 main basic modules/entry points: NCG::SiteSet and NCG::SiteInfo:: |
| | 367 | * Also several internal modules to define probes to use, ... that would benefit to receive information from Quattor |
| | 368 | * Exemple: configure all probes for a CE if the node is configured as a CE |
| | 369 | |
| | 370 | Integration between NCG and XML doesn't scale as it is far too long. Need to pre-process data and this is done with QAD. |
| | 371 | * Currently more a proof of concept than a ready-to-use tool |
| | 372 | |
| | 373 | Discussion: |
| | 374 | * Need to figure out how to use NCG to define services without defining hosts and rely on Quattor for host definitions of hosts managed by Quattor |
| | 375 | * Create a small working group with a specific mailing list: quattor-monitoring |
| | 376 | * TCD will commit their change to the generic templates: no specific change |
| | 377 | * NIKHEF will document the change they had to make to generic templates to identify additional customizations needed |
| | 378 | * CNAF: currently the storage group has its own way of configuring Nagios with filecopy, difficult to change even if convinced |
| | 379 | * RAL interested but probably not in the short term |
| | 380 | * ULB/BEGrid started to look at it |
| | 381 | * Guillaume will fix the Nagios example in QWG repository |
| | 382 | * Potential scalability pb if nagios server profile need to depend on all the profiles for all the machines it monitors |
| | 383 | |
| | 384 | |
| | 385 | == QUEST Proposal Status - M. Jouvin == |
| | 386 | |
| | 387 | See [http://indico.cern.ch/getFile.py/access?contribId=3&sessionId=2&resId=1&materialId=slides&confId=67632 slides]. |
| 307 | | |
| 308 | | == QUEST Proposal Status - M. Jouvin == |
| 309 | | |
| 310 | | See [http://indico.cern.ch/getFile.py/access?contribId=3&sessionId=2&resId=1&materialId=slides&confId=67632 slides]. |
| 311 | | |
| 312 | | == Other Developments == |
| 313 | | |
| 314 | | === Quattor Asset Database - C. Tryantafyllydys === |
| 315 | | |
| 316 | | AUTH needed a tool to ease everyday tasks for new sysadmins and allow to feed other tools with Quattor data |
| 317 | | * Eg. configure a monitoring system from Quattor |
| 318 | | |
| 319 | | QAD main characteristics: |
| 320 | | * Written in Ruby |
| 321 | | * A few schema extensions: /monitiroing, network "parent" (switch it depends on), rack |
| 322 | | * Need to be merged with CERN similar extensions |
| 323 | | * /monitoring is an alternative to the LEMON /monitoring |
| 324 | | * Not yet decided: access to SVN? read or write? |
| 325 | | * Currently using a post-commit hook to synchronize QAD db with SCDB deploys: impact on performance |
| 326 | | * Would like to remove this in the future and rely on XML but some high level information is currently missing: need to extend the schema |
| 327 | | * Allow to change some node characteristics: OS, IP... and redeploy |
| 328 | | * From the config db, can list nodes attached to a specific switch: rely on configured information |
| 329 | | * Can list all the VMs attached to a host (using the enclosure definition) |
| 330 | | |
| 331 | | Need to make progress on using CDB2SQL back-end for producing/maintaining db and use it in all tools that need a build a db from the XML |
| 332 | | * Rename XML2SQL |
| 333 | | * CERN will provide an update on the status and the best version to start from |
| 334 | | * May look at document-oriented dbs, liek CacheDB |
| 335 | | |
| 336 | | === Remote Configuration with Quattor - N. Williams === |
| 337 | | |
| 338 | | ''Disclaimer: doesn't currently work with SCDB (because of dependency over CDB notification).'' |
| 339 | | |
| 340 | | Goal: apply Quattor benefits to boxes that cannot run Quattor client by using delegation |
| 341 | | * A Linux box will acts as a delegate where a configuration module will execute appropriate configuration commands |
| 342 | | * Combination of AII and and CCM/NCM/NCD |
| 343 | | |
| 344 | | Current implementation: |
| 345 | | * quattor-remote-dispatcher (QRD): a tool running on Linux box and receiving CDB notification messages. It acts as a replacement for listend, cdispd, ccm-fetch, ncm-ncd |
| 346 | | * Configuration allows to define which part of the configuration are listened and what is the command to run |
| 347 | | * Can use different commands on different sets of nodes |
| 348 | | * Can define constraints on some part of the configuration to do different things based on some configuration state (for example state=build or production) |
| 349 | | * quattor-remote-configure: an AII equivalent allowing to produce a new configuration for the managed box and notify the remote dispatcher |
| 350 | | * Configuration of the managed device is under /software/components |
| 351 | | * The actual component to use is defined by /system/components/namespace: default is NCM::NCD:: but can be defined to something specific to a device, eg. ESX, Netapp... |
| 352 | | |
| 353 | | Used to manage virtualization in Aquilon (for a WMware Hypervisor) |
| 354 | | * VMs are not associated with actual hosts (handled for example by VMware) but with clusters |
| 355 | | * A cluster is a group of hosts running an hypervisor. There is an object template for each cluster. |
| 356 | | * Each virtual host (machine running an hypervisor) has also an object profile that allows its configuration out of the box |
| 357 | | * VMs are managed as normal machines |
| 358 | | |
| 359 | | Plan to release this as soon as it is polished but will not be able to release the specific components as they are very MS specifics and sometimes rely on non public APIs. |
| 360 | | |
| 361 | | === Virtualisation - D. O'Callaghan === |
| 362 | | |
| 363 | | Currently used Hypervisors configured with Quattor: |
| 364 | | * Xen: TCD, NIKHEF, CERN, AUTH |
| 365 | | * OpenVZ: UAM |
| 366 | | * VMware: MS |
| 367 | | |
| 368 | | Also in use but not managed by Quattor: |
| 369 | | * KVM: CNAF |
| 370 | | * Hyper-V: CERN |
| 371 | | |
| 372 | | VM cluster managers of interest: Platform/VMO, OpenNebula |
| 373 | | |
| 374 | | Issues and requests: |
| 375 | | * Use libvirt, common configuration module for Linux-base hypervisors |
| 376 | | * Could look at what is done by Puppet or other config tools |
| 377 | | * Creation of images |
| 378 | | |
| 379 | | QWG templates for basic virtualisation: mainly used at TCD (also in Senegal!) |
| 380 | | * Rely on ncm-xen, would be great if we had support KVM, probably using libvirt |
| 381 | | |
| 382 | | === Integration with Monitoring Systems - C. Tryantafyllydys === |
| 383 | | |
| 384 | | Porblem: current Nagios configuration in QWG relies on everything being described (in particular probles) in the configuration. But EGEE and OAT developped NCG to do it more dynamically on a specific node. How to take advantage of this? |
| 385 | | |
| 386 | | NCD: generic tool to define Nagios configuration based on context |
| 387 | | * 2 main basic modules/entry points: NCG::SiteSet and NCG::SiteInfo:: |
| 388 | | * Also several internal modules to define probes to use, ... that would benefit to receive information from Quattor |
| 389 | | * Exemple: configure all probes for a CE if the node is configured as a CE |
| 390 | | |
| 391 | | Integration between NCG and XML doesn't scale as it is far too long. Need to pre-process data and this is done with QAD. |
| 392 | | * Currently more a proof of concept than a ready-to-use tool |
| 393 | | |
| 394 | | Discussion: |
| 395 | | * Need to figure out how to use NCG to define services without defining hosts and rely on Quattor for host definitions of hosts managed by Quattor |
| 396 | | * Create a small working group with a specific mailing list: quattor-monitoring |
| 397 | | * TCD will commit their change to the generic templates: no specific change |
| 398 | | * NIKHEF will document the change they had to make to generic templates to identify additional customizations needed |
| 399 | | * CNAF: currently the storage group has its own way of configuring Nagios with filecopy, difficult to change even if convinced |
| 400 | | * RAL interested but probably not in the short term |
| 401 | | * ULB/BEGrid started to look at it |
| 402 | | * Guillaume will fix the Nagios example in QWG repository |
| 403 | | * Potential scalability pb if nagios server profile need to depend on all the profiles for all the machines it monitors |
| 404 | | |
| | 405 | * Commands: wrapper for quattor commands on managed nodes and on AII server developped by David as an action after last workshop |
| | 406 | * https://sourceforge.net/apps/mediawiki/quattor/index.php?title=Command_Renaming |
| | 407 | * `quattor` command as a wrapper over ncm-ncd, ccm-fetch, ncm-query... |
| | 408 | * `quattor-installer` as a wrapper over aii-xxx |
| | 409 | * For AII, should not distinguish between local and remote (`installfe` and `shellfe`) |
| | 410 | * In both cases, symlinks created for each subcommands to allow to use shell completion (à la git) |
| | 411 | * Currently deployed by a template: David will try to build 2 separate RPMs (1 for managed nodes, 1 for AII server) |