Changes between Version 2 and Version 3 of Meetings/Workshops/20071029


Ignore:
Timestamp:
Oct 30, 2007, 12:54:18 PM (17 years ago)
Author:
jouvin
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Meetings/Workshops/20071029

    v2 v3  
    44[[TOC(inline)]]
    55
     6Agenda : http://indico.cern.ch/conferenceTimeTable.py?confId=20479
    67
    78== Site Reports ==
     
    185186 * Deprecate include with literal name in v8 : the change required is compatible with panc v6. Agreed.
    186187 * Introduce include DML without braces at the same time former include syntax is unsupported (v9).
     188 * CERN / CNAF agree to migrate to  panc v7 asap. For CERN, mainly the problem of assessing performance problems are fixed.
     189 * Interval of 6 months minimum between major releases. Support version N-1.
     190
     191=== CDB ===
     192
     193Changes in the last 6 months
     194 * cdb-cli : support for user-defined procedures, minimal support for disconnected operations, preparation for namespace support
     195  * CDB deployer : ease setup of a test configuration on a set of nodes.
     196    * Previously involved a lot of manual operations (mainly copy back and forth from production templates).
     197    * Take advantages of namespaces and loadpaths : a ''stage'' (area) is prepend to the template name by defining the appropriate loadpath
     198    * A (non-staged) meta-template defines the appropriate loadpath
     199    * CDB deploy command (template replication) allow to move a configuration from one stage to another one (e.g. : test, preprod, prod)
     200
     201Not done yet :
     202 * Visualization/navigation tool : PAN parser available but not yet integrated. Another option is to wait for panc v8.
     203 * SOAP eating too much memory..
     204 * Fine-grained locking
     205 * Common authentication service
     206 * Reverting committed changes
     207
     208Other issues :
     209 * ME leaving CERN in a couple of months...
     210
     211=== SCDB ===
     212
     213http://indico.cern.ch/materialDisplay.py?contribId=7&sessionId=3&materialId=slides&confId=20479
     214
     215Discussion :
     216 * Add script for generating HW templates from a CSV file into SCDB utilities
     217 * Allow Google to index Trac LCGQWG site
     218
     219=== Other Core Modules ===
     220
     221NotD : framework which allows users to authenticate using different mechanism and configure notifification using wassh-ssm
     222
     223No progress foreseen on CDB2SQL
     224 * Plan is to rewrite with a direct XML DB backend to avoid reparsing XML
     225 
     226Generic, plug-in based, login and authentication library : no real progress
     227 * Mainly for CDB, SWrep
     228
     229CCM :
     230 * Support for fetching foreign profiles added
     231 * De privilege CCM execution still pending
     232
     233NCM : lot of pending issues/requests. No progress.
     234 * cdispd :
     235   * Global activation switch
     236   * Restard after changes in modules
     237   * Improve handling of pre dependencies (Savanah 23814)
     238 * ncm-ncd :
     239    * Generic 'requirement' like package manager to allow a site to use its preferred package manager (Savanah 17681)
     240    * 'NOACTION_SUPPORTED' to allow dry run of a component (Savanah 20040)
     241    * --skip option to run all components except a few ones (Savanah 20619)
     242    * Support for RPM-less components (Savanah 21204)
     243
     244NCM components :
     245 * ncm-filesystems by Luis
     246   * Used at CERN for Xen-based vitrualization
     247 * lib/blockdevices : generic support for managing block devices in other components
     248 * ncm-rac for managing Oracle RAC
     249
     250PAN temmplates :
     251 * New schema for enclosures released
     252 * Fixes in set_partitions() for logical partitions
     253 * Support gLite WMS in schema
     254 * Support for md devices
     255 * Document Quattor standard PAN functions (push, npush,...) in some wiki page.
     256
     257SPMA/rpmt
     258 * SPMA should upgrade and not deinstall/install a package if arch and version is changed (Savanah 19934)
     259 * SPMA gives up at the first IO error (Savanah 29029) : let rpmt run until the end to get the full list of errors
     260
     261
     262== Other Developments ==
     263
     264=== QWG Templates ===
     265
     266See http://indico.cern.ch/materialDisplay.py?contribId=17&sessionId=4&materialId=slides&confId=20479
     267
     268
     269=== Xen Support ===
     270
     271Main tool for Xen support is ncm-xen :
     272 * Several updates since March
     273 * Actual configuration done by several variables (to be documented on QWG wiki) :
     274   * `XEN_GUESTS` : list of VMs to manage, corresponding to profiles
     275   * `XEN_BOOTLOADER` : pypxeboot, pygrub...
     276   * `XEN_BOOTARGS` : e.g. vif strings required to boot with pypxeboot..
     277   * `XEN_RAM`....
     278 * Future plans : delegate filesystem management to ncm-filesystems, support for other use cases (downloading images, use copy on write...)
     279 * Any need for supporting other VMMs (VMware...) ? Or a generic component ncm-virt ?
     280
     281Grid-Ireland experience :
     282 * PXE/KS install of host + guests : `install_server` machine type added. To be committed into standard templates
     283
     284=== Enclosures ===
     285
     286Enclosure description is required to address ore complex system configuration : multiple motherboards per case, blades, VMs...
     287 * Required to manage interventions : all nodes within an enclosure must be powered down
     288 * Generation of KS config file for all systems into the enclosure
     289 * '/hardware/enclosure' used to describe enclosure : CERN uses '/system/enclosure'. Agreement to use /system rather than /hardware as this is not necessarily real HW.
     290
     2912 options :
     292 * parent references children : already in use in Xen VMs
     293 * children reference parent
     294 * Both options are more or less incompatible : need to identify the main use cases.
     295   * For VMs, parent has to know about children : generate Xen config files, set up local backing store for VM FS...
     296   * For VMs, children don't car about their parents
     297   * The same apply to AII
     298 * "parent references children" seems to be the appropriate approach : CERN also uses it. Is there any missed use case for the other option ?
     299
     300=== Filesystems and Blockdevices ===
     301
     302See http://indico.cern.ch/materialDisplay.py?contribId=1&sessionId=5&materialId=slides&confId=20479
     303
     304Current layout (organized per HW disk) has several limitations :
     305 * Poor control of advanced options (tuning, raid spanning several disks)
     306 * SW raid is very difficult to describe with current schema
     307 * HW raid is impossible to control
     308
     309Design for proposed new model (Tim Bell/Andras Horvath) :
     310 * Separation between filesystems and block devices descriptions
     311 * Filesystems reference block devices
     312 * Several kind of block devices : block, hw_raid
     313 * Partitions are described inside block devices
     314 * LVM and SW raid defined separatly from block devices, referencing block devices
     315
     316Drawbacks and problems of proposed new model :
     317 * Too complex to implement : bi-directional inter-dependencies
     318 * Too close to "natural approach"
     319
     320New proposal (Luis) : pure top-down model
     321 * Filesystems referencing block devices
     322 * Block devices can be LVM volumes groups, SW raid, HW Raid, partitions, disks... Under /system/blockdevices. Several types of block devices available
     323 * A SW or LVM refers to another set of block devices (partitions, HW raid, disks...)
     324 * Partitions are a king of block devices referencing HW devices they seat in.
     325 * Very flexible and easy to extend
     326 * Currently allocated size is not checked against the (declared) disk size. Need to be added.
     327
     328This model has been implemented into ncm-filesystems and AII v2. Ready for integration into the main schema.
     329 * Need to confirm agreement from Tim and Andras
     330
     331=== ncm-filesystems ===
     332
     333Handles creation and modification of filesystems after installation
     334 * Required for several use cases, in particular VM management
     335 * Currently filesystems and blockdevices are described into component configuration : will be changed as soon as they have been included into the standard schema under /system.
     336
     337Uses a Perl class library (lib/blockdevices) to do the real jobs.
     338 * Same library is used by AII v2
     339
     340Future plans :
     341 * Quota management
     342 * Resizing operations
     343 * HW raid management
     344
     345=== AII ===
     346
     347AII 1.x status and problems :
     348 * KS support embedded into AII : difficult to support other installation systems (like JumpStart)
     349 * SW raid difficult
     350 * LVM support buggy
     351 * Current schema structure limits possibility for any improvement
     352
     353AII 2.0 design goals :
     354 * Support for installation method other than KS+PXE
     355 * Easier maintenance of Kickstart template : allow for easy integration of site customization
     356 * Support for complex device schemes
     357 * Command line (aii-shellfe) interface unchanged (only --notify has not been reimplemented)
     358 * AII is now just a plug-in loader (ncm-ncd model). 3 available plug-ins :
     359   * osinstall plug-in : KS installation
     360   * NBP plug-in : PXE configuration
     361   * DHCP plug-in : DHCP configuration
     362 * AII plug-ins inherit from NCM::Component and fetch profiles directly from (S)CDB
     363 * KS generator :
     364   * Uses ncm-lib-blockdevices, blockdevices/filesystems configured during %pre installation phase (better control of what is created/destroyed)
     365   * Uses hooks for local site customization : no more KS template. Ability to use and combine an arbitrary number of hooks. Organized by Kickstart phase (pre_install, post_install, post_reboot). Hooks must be implement as new methods for a NCM object subclass.
     366 * PXE generator : replaces PXE template and nbp-aii. Backward compatible, configuration relocated to /system/aii/nbp/pxelinux.
     367 
     368Future plans :
     369 * Privilege dropping : AII doesn't need to run as root for the most part.
     370 * Enclosure support : --recursive ?
     371
     372First production version may be ready by end of 2007.
     373