Changes between Version 1 and Version 2 of Meetings/Workshops/20111011


Ignore:
Timestamp:
Oct 14, 2011, 8:59:11 AM (14 years ago)
Author:
/O=GRID-FR/C=FR/O=CNRS/OU=LAL/CN=Michel Jouvin
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Meetings/Workshops/20111011

    v1 v2  
    148148   * Use stand-up/review meetings to decide merging into trunk
    149149   
     150
     151== Round Tables ==
     152
     153=== Kickstart / AII === #AII
     154
     155Disk partitionning: current scheme has advantages (full control) but doesn't allow to format large boot disks (> 2GB)
     156 * Support for GPT partitions: need to check if Anaconda supports it
     157 * Using standard Anaconda partitionning features would help taking advantage of new features but should accept to give up some control
     158 * Need to check that reinstallation works in all configuration
     159 
     160Kickstart configuration is done by an AII plugin: could if necessary replace the plugin or add a hook
     161 * NIKHEF already ignores pre-script and produces the kickstart instructions required to use standard Anaconda formatting features
     162 * NIKHEF has written a hook disklayout that could be used as an alternative to aii-ks call to lib-blockdevices
     163 * In fact `aii-ks`/`lib-blockdevices` already generates (almost) the required information for Anaconda, just with `noformat`
     164 
     165Need to add as a standard AII function the function allowing to add a new hook whatever are the hooks already defined
     166 * Such an helper function already exists in hook for SINDES
     167 
     168`aii-ks`: really fix pbs mentionned in ticket #235 (See #235#comment4)
     169
     170`aii-pxelinux`: see if it possible to specify the MAC address to use rather than the interface name
     171 * From http://wiki.centos.org/TipsAndTricks/KickStart:
     172 {{{
     173 A third method works if you are doing PXE based installations. Then you add IPAPPEND 2 to the PXE configuration file and use ksdevice=bootif. In this case anaconda will use the interface that did the PXE boot (this does not necessarily needs to be the first one with a active link).
     174 }}}
     175 
     176=== SL/RHEL6 Support ===
     177
     178MAC address binding to interfaces: now done by `udev` rather than the interface config file.
     179 * May be a source of problems as `udev` config is done by Kickstart before interface config files are configured by Quattor with some risk of inconsistency
     180 * 1 possibility to explore: use non standard name in Quattor config and define `udev` rules appropriatly
     181   * Check impact on apps and operations
     182   
     183On compute nodes running a RHEL6 derivative, kernel option `nohz` should be turned `off`(`nohz=off`) in Grub config to get better performances (Stijn)
     184 * Scheduler option to make it more responsive on desktops
     185 * Define by defaults for RHEL6 derivative?
     186 
     187=== SPMA vs. YUM ===
     188
     189See Steve Traylen's https://twiki.cern.ch/twiki/bin/view/Main/SteveTraylen/Spma2Yum.
     190 * All SPMA features can now be implemented with YUM
     191 * Main drawback is the cost of the migration...
     192 
     193Several open questions
     194 * `ncm-yum requires some extensions not handled by anybody
     195 * What is the right level of detail in package list: proposed use of repository snapshots will probably not work accross sites to implement things like OS errata
     196 * We need to implement some kind of metadependency for a packager: something like a componenent alias name implemented in `ncm-ncd`
     197   * This may replace the need for SPMA as an abstract layer to the packager, something that doesn't work properly as the package description is quite tightly coupled with the actual packager, making a common schema difficult.
     198
     199=== Change Scheduling ===
     200
     201MS use cases
     202 * `spma` not allowed to run on a running host, can only run at reboot
     203   * `ncm-spma` runs but with the flag to run `spma` disabled to avoid preventing other components to run
     204 * Some actions cannot be scheduled without reinstalling: would be great to let the user knows that his changes will not be applied immediatly
     205   * May be handled at Aquilon (SCDB?) level: will require a partial diff of XML profiles: need to find a tool to do this
     206 
     207Partial configuration changes look difficult as Quattor is really designed to enforce consistency on a "all or nothing" basis.
     208
     209The most important feature would be the implementation of changes at controlled time.
     210
     211For SPMA, it'd be nice to establish a black list of packages that should not be upgraded except at boot time.
     212
     213`ncm-ncd`: add a mechanism to blacklist some components if some conditions are not met.
     214
     215List of possibile conditions/states (local time, boot context...) shoud be well defined and a library/common method should be available for all components to evaluate the state/condition the same way.
     216 * Exact actions done if a condition is met will remain the responsability of the components or `ncm-ncd`
     217 
     218Would be useful to let a component advertize that a change will require a reboot
     219 * Could be implemented through the status file produced when the component is run
     220 
     221=== Network Configuration ===
     222
     223MS main requirements
     224 * Bonding
     225 * policy routing
     226 * ethertools parameters
     227 
     228Some patches contributed for current `ncm-network` to handle this but would be good to rethink the schema
     229 * In particular the component configuration should be moved to `/software/components` and most of the things currently under `/system/network` should be moved there to allow an easier support of appliances and non Linux boxes
     230   * In particular most things currently under `/system/network/interfaces`
     231   * Some children of interfaces in the configuration should be probably at the same level (bonds, vlans...) in the same way we have blockdevices and filesystems
     232   * Gabor will initiate the discussion on the mailing list with a proposal
     233   
     234`ncm-network` rewrite: a basic version is available in SVN, ready for code review
     235 * https://quattor.svn.sourceforge.net/svnroot/quattor/branches/ncm-network-rewrite-maven
     236 * Try to deploy on test machines after initial code review
     237
     238=== QWG Global Variables ===
     239
     240Hot discussion about use of global variables in QWG
     241 * Contents unvalidated
     242 * Request to put them in the profile to get them validated
     243   * Michel strongly opposed as the state value at one point in time may be misleading: what is important is the value in the context of a given template, something that cannot be tracked in the XML profile
     244 * Side discussion: filecopy unvalided contents
     245 
     246Until we have a pan debugger (no concrete project anymore), some possibilities:
     247 * Ensure there is the appropriate debug statements in the template
     248 * Consider as bugs templates using a variable without any content validation (when it makes sense) and fix them!
     249 * See with Cal if a `dump_global_variables()` function would make sense in PAN to dump all global variable at one point in time
     250   * If only a subset of the variables are needed, can probably be done with a `debug()` statement
     251   * `dump_global_variable` could accept as an arguement a regexp to match the variable name to restrict to variables used in the template
     252   
     253Discuss with call the possibility to dislay with a Pan option the name of the template entered and wheter it is executed or ignored because it has the `unique` tag.
     254
     255Document the global variable "namespaces" used in QWG variables.
     256
     257=== Configuration Component Logging ===
     258
     259Document/agree on what is the expected level of default verbosity for a component (log/info) and what is the debug level to use for what.
     260
     261== Sprint 3 Review ==
     262
     263