Changes between Version 2 and Version 3 of Meetings/Workshops/20091104


Ignore:
Timestamp:
Nov 5, 2009, 4:18:01 PM (16 years ago)
Author:
/O=GRID-FR/C=FR/O=CNRS/OU=LAL/CN=Michel Jouvin
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Meetings/Workshops/20091104

    v2 v3  
    288288Need to discuss how to enforce annotations for high-level services (features) and produce some tools to easily access this information.
    289289
     290
     291== Other Developments ==
     292
     293=== Quattor Asset Database - C. Tryantafyllydys ===
     294
     295AUTH needed a tool to ease everyday tasks for new sysadmins and allow to feed other tools with Quattor data
     296 * Eg. configure a monitoring system from Quattor
     297 
     298QAD main characteristics:
     299 * Written in Ruby
     300 * A few schema extensions:  /monitiroing, network "parent" (switch it depends on), rack
     301   * Need to be merged with CERN similar extensions
     302   * /monitoring is an alternative to the LEMON /monitoring
     303 * Not yet decided: access to SVN? read or write?
     304   * Currently using a post-commit hook to synchronize QAD db with SCDB deploys: impact on performance
     305   * Would like to remove this in the future and rely on XML but some high level information is currently missing: need to extend the schema
     306 * Allow to change some node characteristics: OS, IP... and redeploy
     307 * From the config db, can list nodes attached to a specific switch: rely on configured information
     308 * Can list all the VMs attached to a host (using the enclosure definition)
     309 
     310Need to make progress on using CDB2SQL back-end for producing/maintaining db and use it in all tools that need a build a db from the XML
     311 * Rename XML2SQL
     312   * CERN will provide an update on the status and the best version to start from
     313 * May look at document-oriented dbs, liek CacheDB
     314 
     315=== Remote Configuration with Quattor - N. Williams ===
     316
     317''Disclaimer: doesn't currently work with SCDB (because of dependency over CDB notification).''
     318
     319Goal: apply Quattor benefits to boxes that cannot run Quattor client by using delegation
     320 * A Linux box will acts as a delegate where  a configuration module will execute appropriate configuration commands
     321 * Combination of AII and and CCM/NCM/NCD
     322 
     323Current implementation:
     324 * quattor-remote-dispatcher (QRD): a tool running on Linux box and receiving CDB notification messages. It acts as a replacement for listend, cdispd, ccm-fetch, ncm-ncd
     325   * Configuration allows to define which part of the configuration are listened and what is the command to run
     326   * Can use different commands on different sets of nodes
     327   * Can define constraints on some part of the configuration to do different things based on some configuration state (for example state=build or production)
     328 * quattor-remote-configure: an AII equivalent allowing to produce a new configuration for the managed box and notify the remote dispatcher
     329   * Configuration of the managed device is  under /software/components
     330   * The actual component to use is defined by /system/components/namespace: default is NCM::NCD:: but can be defined to something specific to a device, eg. ESX, Netapp...
     331
     332Used to manage virtualization in Aquilon (for a WMware Hypervisor)
     333 * VMs are not associated with actual hosts (handled for example by VMware) but with clusters
     334   * A cluster is a  group of hosts running an hypervisor. There is an object template for each cluster.
     335   * Each virtual host (machine running an hypervisor) has also an object profile that allows its configuration out of the box
     336 * VMs are managed as normal machines
     337   
     338Plan to release this as soon as it is polished but will not be able to release the specific components as they are very MS specifics and sometimes rely on non public APIs.
     339
     340=== Virtualisation - D. O'Callaghan ===
     341
     342Currently used Hypervisors configured with Quattor:
     343 * Xen: TCD, NIKHEF, CERN, AUTH
     344 * OpenVZ: UAM
     345 * VMware: MS
     346 
     347Also in use but not managed by Quattor:
     348 * KVM: CNAF
     349 * Hyper-V: CERN
     350 
     351 VM cluster managers of interest: Platform/VMO, OpenNebula
     352 
     353Issues and requests:
     354 * Use libvirt, common configuration module for Linux-base hypervisors
     355   * Could look at what is done by Puppet or other config tools
     356 * Creation of images
     357 
     358QWG templates for basic virtualisation: mainly used at TCD (also in Senegal!)
     359 * Rely on ncm-xen, would be great if we had support KVM, probably using libvirt
     360 
     361=== Integration with Monitoring Systems - C. Tryantafyllydys ===
     362
     363Porblem: current Nagios configuration in QWG relies on everything being described (in particular probles) in the configuration. But EGEE and OAT developped NCG to do it more dynamically on a specific node. How to take advantage of this?
     364
     365NCD: generic tool to define Nagios configuration based on context
     366 * 2 main basic modules/entry points: NCG::SiteSet and NCG::SiteInfo::
     367 * Also several internal modules to define probes to use, ... that would benefit to receive information from Quattor
     368   * Exemple: configure all probes for a CE if the node is configured as a CE
     369 
     370Integration between NCG and XML doesn't scale as it is far too long. Need to pre-process data and this is done with QAD.
     371 * Currently more a proof of concept than a ready-to-use tool
     372 
     373Discussion:
     374 * Need to figure out how to use NCG to define services without defining hosts and rely on Quattor for host definitions of hosts managed by Quattor
     375 * Create a small working group with a specific mailing list: quattor-monitoring
     376 * TCD will commit their change to the generic templates: no specific change
     377 * NIKHEF will document the change they had to make to generic templates to identify additional customizations needed
     378 * CNAF: currently the storage group has its own way of configuring Nagios with filecopy, difficult to change even if convinced
     379 * RAL interested but probably not in the short term
     380 * ULB/BEGrid started to look at it
     381 * Guillaume will fix the Nagios example in QWG repository
     382 * Potential scalability pb if nagios server profile need to depend on all the profiles for all the machines it monitors
     383 
     384
     385== QUEST Proposal Status - M. Jouvin ==
     386
     387See [http://indico.cern.ch/getFile.py/access?contribId=3&sessionId=2&resId=1&materialId=slides&confId=67632 slides].
    290388
    291389== Improving Quattor's Accessibility - M. Jouvin ==
     
    305403   * One proposal is CMDB for ''Configuration Management DB''. Need more thinking
    306404   * May be not really an issue anymore with CERN left as the only site using CDB...
    307 
    308 == QUEST Proposal Status - M. Jouvin ==
    309 
    310 See [http://indico.cern.ch/getFile.py/access?contribId=3&sessionId=2&resId=1&materialId=slides&confId=67632 slides].
    311 
    312 == Other Developments ==
    313 
    314 === Quattor Asset Database - C. Tryantafyllydys ===
    315 
    316 AUTH needed a tool to ease everyday tasks for new sysadmins and allow to feed other tools with Quattor data
    317  * Eg. configure a monitoring system from Quattor
    318  
    319 QAD main characteristics:
    320  * Written in Ruby
    321  * A few schema extensions:  /monitiroing, network "parent" (switch it depends on), rack
    322    * Need to be merged with CERN similar extensions
    323    * /monitoring is an alternative to the LEMON /monitoring
    324  * Not yet decided: access to SVN? read or write?
    325    * Currently using a post-commit hook to synchronize QAD db with SCDB deploys: impact on performance
    326    * Would like to remove this in the future and rely on XML but some high level information is currently missing: need to extend the schema
    327  * Allow to change some node characteristics: OS, IP... and redeploy
    328  * From the config db, can list nodes attached to a specific switch: rely on configured information
    329  * Can list all the VMs attached to a host (using the enclosure definition)
    330  
    331 Need to make progress on using CDB2SQL back-end for producing/maintaining db and use it in all tools that need a build a db from the XML
    332  * Rename XML2SQL
    333    * CERN will provide an update on the status and the best version to start from
    334  * May look at document-oriented dbs, liek CacheDB
    335  
    336 === Remote Configuration with Quattor - N. Williams ===
    337 
    338 ''Disclaimer: doesn't currently work with SCDB (because of dependency over CDB notification).''
    339 
    340 Goal: apply Quattor benefits to boxes that cannot run Quattor client by using delegation
    341  * A Linux box will acts as a delegate where  a configuration module will execute appropriate configuration commands
    342  * Combination of AII and and CCM/NCM/NCD
    343  
    344 Current implementation:
    345  * quattor-remote-dispatcher (QRD): a tool running on Linux box and receiving CDB notification messages. It acts as a replacement for listend, cdispd, ccm-fetch, ncm-ncd
    346    * Configuration allows to define which part of the configuration are listened and what is the command to run
    347    * Can use different commands on different sets of nodes
    348    * Can define constraints on some part of the configuration to do different things based on some configuration state (for example state=build or production)
    349  * quattor-remote-configure: an AII equivalent allowing to produce a new configuration for the managed box and notify the remote dispatcher
    350    * Configuration of the managed device is  under /software/components
    351    * The actual component to use is defined by /system/components/namespace: default is NCM::NCD:: but can be defined to something specific to a device, eg. ESX, Netapp...
    352 
    353 Used to manage virtualization in Aquilon (for a WMware Hypervisor)
    354  * VMs are not associated with actual hosts (handled for example by VMware) but with clusters
    355    * A cluster is a  group of hosts running an hypervisor. There is an object template for each cluster.
    356    * Each virtual host (machine running an hypervisor) has also an object profile that allows its configuration out of the box
    357  * VMs are managed as normal machines
    358    
    359 Plan to release this as soon as it is polished but will not be able to release the specific components as they are very MS specifics and sometimes rely on non public APIs.
    360 
    361 === Virtualisation - D. O'Callaghan ===
    362 
    363 Currently used Hypervisors configured with Quattor:
    364  * Xen: TCD, NIKHEF, CERN, AUTH
    365  * OpenVZ: UAM
    366  * VMware: MS
    367  
    368 Also in use but not managed by Quattor:
    369  * KVM: CNAF
    370  * Hyper-V: CERN
    371  
    372  VM cluster managers of interest: Platform/VMO, OpenNebula
    373  
    374 Issues and requests:
    375  * Use libvirt, common configuration module for Linux-base hypervisors
    376    * Could look at what is done by Puppet or other config tools
    377  * Creation of images
    378  
    379 QWG templates for basic virtualisation: mainly used at TCD (also in Senegal!)
    380  * Rely on ncm-xen, would be great if we had support KVM, probably using libvirt
    381  
    382 === Integration with Monitoring Systems - C. Tryantafyllydys ===
    383 
    384 Porblem: current Nagios configuration in QWG relies on everything being described (in particular probles) in the configuration. But EGEE and OAT developped NCG to do it more dynamically on a specific node. How to take advantage of this?
    385 
    386 NCD: generic tool to define Nagios configuration based on context
    387  * 2 main basic modules/entry points: NCG::SiteSet and NCG::SiteInfo::
    388  * Also several internal modules to define probes to use, ... that would benefit to receive information from Quattor
    389    * Exemple: configure all probes for a CE if the node is configured as a CE
    390  
    391 Integration between NCG and XML doesn't scale as it is far too long. Need to pre-process data and this is done with QAD.
    392  * Currently more a proof of concept than a ready-to-use tool
    393  
    394 Discussion:
    395  * Need to figure out how to use NCG to define services without defining hosts and rely on Quattor for host definitions of hosts managed by Quattor
    396  * Create a small working group with a specific mailing list: quattor-monitoring
    397  * TCD will commit their change to the generic templates: no specific change
    398  * NIKHEF will document the change they had to make to generic templates to identify additional customizations needed
    399  * CNAF: currently the storage group has its own way of configuring Nagios with filecopy, difficult to change even if convinced
    400  * RAL interested but probably not in the short term
    401  * ULB/BEGrid started to look at it
    402  * Guillaume will fix the Nagios example in QWG repository
    403  * Potential scalability pb if nagios server profile need to depend on all the profiles for all the machines it monitors
    404  
     405 * Commands: wrapper for quattor commands on managed nodes and on AII server developped by David as an action after last workshop
     406   * https://sourceforge.net/apps/mediawiki/quattor/index.php?title=Command_Renaming
     407   * `quattor` command as a wrapper over ncm-ncd, ccm-fetch, ncm-query...
     408   *  `quattor-installer` as a wrapper over aii-xxx
     409     * For AII, should not distinguish between local and remote (`installfe` and `shellfe`)
     410   * In both cases, symlinks created for each subcommands to allow to use shell completion (à la git)
     411   * Currently deployed by a template: David will try to build 2 separate RPMs (1 for managed nodes, 1 for AII server)
    405412
    406413== Miscellanous ==