| 188 | * CERN / CNAF agree to migrate to panc v7 asap. For CERN, mainly the problem of assessing performance problems are fixed. |
| 189 | * Interval of 6 months minimum between major releases. Support version N-1. |
| 190 | |
| 191 | === CDB === |
| 192 | |
| 193 | Changes in the last 6 months |
| 194 | * cdb-cli : support for user-defined procedures, minimal support for disconnected operations, preparation for namespace support |
| 195 | * CDB deployer : ease setup of a test configuration on a set of nodes. |
| 196 | * Previously involved a lot of manual operations (mainly copy back and forth from production templates). |
| 197 | * Take advantages of namespaces and loadpaths : a ''stage'' (area) is prepend to the template name by defining the appropriate loadpath |
| 198 | * A (non-staged) meta-template defines the appropriate loadpath |
| 199 | * CDB deploy command (template replication) allow to move a configuration from one stage to another one (e.g. : test, preprod, prod) |
| 200 | |
| 201 | Not done yet : |
| 202 | * Visualization/navigation tool : PAN parser available but not yet integrated. Another option is to wait for panc v8. |
| 203 | * SOAP eating too much memory.. |
| 204 | * Fine-grained locking |
| 205 | * Common authentication service |
| 206 | * Reverting committed changes |
| 207 | |
| 208 | Other issues : |
| 209 | * ME leaving CERN in a couple of months... |
| 210 | |
| 211 | === SCDB === |
| 212 | |
| 213 | http://indico.cern.ch/materialDisplay.py?contribId=7&sessionId=3&materialId=slides&confId=20479 |
| 214 | |
| 215 | Discussion : |
| 216 | * Add script for generating HW templates from a CSV file into SCDB utilities |
| 217 | * Allow Google to index Trac LCGQWG site |
| 218 | |
| 219 | === Other Core Modules === |
| 220 | |
| 221 | NotD : framework which allows users to authenticate using different mechanism and configure notifification using wassh-ssm |
| 222 | |
| 223 | No progress foreseen on CDB2SQL |
| 224 | * Plan is to rewrite with a direct XML DB backend to avoid reparsing XML |
| 225 | |
| 226 | Generic, plug-in based, login and authentication library : no real progress |
| 227 | * Mainly for CDB, SWrep |
| 228 | |
| 229 | CCM : |
| 230 | * Support for fetching foreign profiles added |
| 231 | * De privilege CCM execution still pending |
| 232 | |
| 233 | NCM : lot of pending issues/requests. No progress. |
| 234 | * cdispd : |
| 235 | * Global activation switch |
| 236 | * Restard after changes in modules |
| 237 | * Improve handling of pre dependencies (Savanah 23814) |
| 238 | * ncm-ncd : |
| 239 | * Generic 'requirement' like package manager to allow a site to use its preferred package manager (Savanah 17681) |
| 240 | * 'NOACTION_SUPPORTED' to allow dry run of a component (Savanah 20040) |
| 241 | * --skip option to run all components except a few ones (Savanah 20619) |
| 242 | * Support for RPM-less components (Savanah 21204) |
| 243 | |
| 244 | NCM components : |
| 245 | * ncm-filesystems by Luis |
| 246 | * Used at CERN for Xen-based vitrualization |
| 247 | * lib/blockdevices : generic support for managing block devices in other components |
| 248 | * ncm-rac for managing Oracle RAC |
| 249 | |
| 250 | PAN temmplates : |
| 251 | * New schema for enclosures released |
| 252 | * Fixes in set_partitions() for logical partitions |
| 253 | * Support gLite WMS in schema |
| 254 | * Support for md devices |
| 255 | * Document Quattor standard PAN functions (push, npush,...) in some wiki page. |
| 256 | |
| 257 | SPMA/rpmt |
| 258 | * SPMA should upgrade and not deinstall/install a package if arch and version is changed (Savanah 19934) |
| 259 | * SPMA gives up at the first IO error (Savanah 29029) : let rpmt run until the end to get the full list of errors |
| 260 | |
| 261 | |
| 262 | == Other Developments == |
| 263 | |
| 264 | === QWG Templates === |
| 265 | |
| 266 | See http://indico.cern.ch/materialDisplay.py?contribId=17&sessionId=4&materialId=slides&confId=20479 |
| 267 | |
| 268 | |
| 269 | === Xen Support === |
| 270 | |
| 271 | Main tool for Xen support is ncm-xen : |
| 272 | * Several updates since March |
| 273 | * Actual configuration done by several variables (to be documented on QWG wiki) : |
| 274 | * `XEN_GUESTS` : list of VMs to manage, corresponding to profiles |
| 275 | * `XEN_BOOTLOADER` : pypxeboot, pygrub... |
| 276 | * `XEN_BOOTARGS` : e.g. vif strings required to boot with pypxeboot.. |
| 277 | * `XEN_RAM`.... |
| 278 | * Future plans : delegate filesystem management to ncm-filesystems, support for other use cases (downloading images, use copy on write...) |
| 279 | * Any need for supporting other VMMs (VMware...) ? Or a generic component ncm-virt ? |
| 280 | |
| 281 | Grid-Ireland experience : |
| 282 | * PXE/KS install of host + guests : `install_server` machine type added. To be committed into standard templates |
| 283 | |
| 284 | === Enclosures === |
| 285 | |
| 286 | Enclosure description is required to address ore complex system configuration : multiple motherboards per case, blades, VMs... |
| 287 | * Required to manage interventions : all nodes within an enclosure must be powered down |
| 288 | * Generation of KS config file for all systems into the enclosure |
| 289 | * '/hardware/enclosure' used to describe enclosure : CERN uses '/system/enclosure'. Agreement to use /system rather than /hardware as this is not necessarily real HW. |
| 290 | |
| 291 | 2 options : |
| 292 | * parent references children : already in use in Xen VMs |
| 293 | * children reference parent |
| 294 | * Both options are more or less incompatible : need to identify the main use cases. |
| 295 | * For VMs, parent has to know about children : generate Xen config files, set up local backing store for VM FS... |
| 296 | * For VMs, children don't car about their parents |
| 297 | * The same apply to AII |
| 298 | * "parent references children" seems to be the appropriate approach : CERN also uses it. Is there any missed use case for the other option ? |
| 299 | |
| 300 | === Filesystems and Blockdevices === |
| 301 | |
| 302 | See http://indico.cern.ch/materialDisplay.py?contribId=1&sessionId=5&materialId=slides&confId=20479 |
| 303 | |
| 304 | Current layout (organized per HW disk) has several limitations : |
| 305 | * Poor control of advanced options (tuning, raid spanning several disks) |
| 306 | * SW raid is very difficult to describe with current schema |
| 307 | * HW raid is impossible to control |
| 308 | |
| 309 | Design for proposed new model (Tim Bell/Andras Horvath) : |
| 310 | * Separation between filesystems and block devices descriptions |
| 311 | * Filesystems reference block devices |
| 312 | * Several kind of block devices : block, hw_raid |
| 313 | * Partitions are described inside block devices |
| 314 | * LVM and SW raid defined separatly from block devices, referencing block devices |
| 315 | |
| 316 | Drawbacks and problems of proposed new model : |
| 317 | * Too complex to implement : bi-directional inter-dependencies |
| 318 | * Too close to "natural approach" |
| 319 | |
| 320 | New proposal (Luis) : pure top-down model |
| 321 | * Filesystems referencing block devices |
| 322 | * Block devices can be LVM volumes groups, SW raid, HW Raid, partitions, disks... Under /system/blockdevices. Several types of block devices available |
| 323 | * A SW or LVM refers to another set of block devices (partitions, HW raid, disks...) |
| 324 | * Partitions are a king of block devices referencing HW devices they seat in. |
| 325 | * Very flexible and easy to extend |
| 326 | * Currently allocated size is not checked against the (declared) disk size. Need to be added. |
| 327 | |
| 328 | This model has been implemented into ncm-filesystems and AII v2. Ready for integration into the main schema. |
| 329 | * Need to confirm agreement from Tim and Andras |
| 330 | |
| 331 | === ncm-filesystems === |
| 332 | |
| 333 | Handles creation and modification of filesystems after installation |
| 334 | * Required for several use cases, in particular VM management |
| 335 | * Currently filesystems and blockdevices are described into component configuration : will be changed as soon as they have been included into the standard schema under /system. |
| 336 | |
| 337 | Uses a Perl class library (lib/blockdevices) to do the real jobs. |
| 338 | * Same library is used by AII v2 |
| 339 | |
| 340 | Future plans : |
| 341 | * Quota management |
| 342 | * Resizing operations |
| 343 | * HW raid management |
| 344 | |
| 345 | === AII === |
| 346 | |
| 347 | AII 1.x status and problems : |
| 348 | * KS support embedded into AII : difficult to support other installation systems (like JumpStart) |
| 349 | * SW raid difficult |
| 350 | * LVM support buggy |
| 351 | * Current schema structure limits possibility for any improvement |
| 352 | |
| 353 | AII 2.0 design goals : |
| 354 | * Support for installation method other than KS+PXE |
| 355 | * Easier maintenance of Kickstart template : allow for easy integration of site customization |
| 356 | * Support for complex device schemes |
| 357 | * Command line (aii-shellfe) interface unchanged (only --notify has not been reimplemented) |
| 358 | * AII is now just a plug-in loader (ncm-ncd model). 3 available plug-ins : |
| 359 | * osinstall plug-in : KS installation |
| 360 | * NBP plug-in : PXE configuration |
| 361 | * DHCP plug-in : DHCP configuration |
| 362 | * AII plug-ins inherit from NCM::Component and fetch profiles directly from (S)CDB |
| 363 | * KS generator : |
| 364 | * Uses ncm-lib-blockdevices, blockdevices/filesystems configured during %pre installation phase (better control of what is created/destroyed) |
| 365 | * Uses hooks for local site customization : no more KS template. Ability to use and combine an arbitrary number of hooks. Organized by Kickstart phase (pre_install, post_install, post_reboot). Hooks must be implement as new methods for a NCM object subclass. |
| 366 | * PXE generator : replaces PXE template and nbp-aii. Backward compatible, configuration relocated to /system/aii/nbp/pxelinux. |
| 367 | |
| 368 | Future plans : |
| 369 | * Privilege dropping : AII doesn't need to run as root for the most part. |
| 370 | * Enclosure support : --recursive ? |
| 371 | |
| 372 | First production version may be ready by end of 2007. |
| 373 | |